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NON-ENDOGENOUS, CONSTITUTIVELY ACTIVATED 
HUMAN SEROTONIN RECEPTORS AND SMALL 
MOLECULE MODUALTORS THEREOF 

5 The benefit of U.S. Serial Number 09/060,1 88, filed April 14, 1998 (owned by 

Arena Pharmaceuticals, Inc.) and U.S. Provisional Number 60/090,783, filed June 26, 1998 
(owned by Arena Pharmaceuticals), U.S. Provisional Number 60/1 12,909, filed December 
18, 1998, and U.S. Provisional Number 60/123,000 filed March 5, 1999 is hereby claimed. 

10 FIELD OF THE INVENTION 

The present invention relates to non-endogenous, constitutively active serotonin 
receptors and small molecule modulators thereof. 

BACKGROUND OF THE INVENTION 

15 

I. G protein-coupled receptors 

G protein-coupled receptors share a common structural motif. All these receptors have 
seven sequences of between 22 to 24 hydrophobic amino acids that form seven alpha helices, 
each of which spans the membrane. The transmembrane helices are joined by strands of amino 
20 acids having a larger loop between the fourth and fifth transmembrane helix on the 
extracellular side of the membrane. Another larger loop, composed primarily of hydrophilic 
amino acids, joins transmembrane helices five and six on the intracellular side of the 
membrane. The carboxy terminus of the receptor lies intracellularly with the amino terminus in 
the extracellular space. It is thought that the loop joining helices five and six, as well as, the 
25 carboxy terminus, interact with the G protein. Currently, Gq, Gs, Gi, and Go are G proteins 
that have been identified. The general structure of G protein-coupled receptors is shown in 
Figure 1. 

Under physiological conditions, G protein-coupled receptors exist in the cell membrane 
in equilibrium between two different states or conformations: an “inactive” state and an 
30 “active” state. As shown schematically in Figure 2, a receptor in an inactive state is unable to 
link to the intracellular transduction pathway to produce a biological response. Changing the 
receptor conformation to the active state allows linkage to the transduction pathway and 
produces a biological response. 
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A receptor may be stabilized in an active state by an endogenous ligand or an 
exogenous agonist ligand. Recent discoveries such as, including but not exclusively limited to, 
modifications to the amino acid sequence of the receptor provide means other than ligands to 
stabilize the active state conformation. These means effectively stabilize the receptor in an 
5 active state by simulating the effect of a ligand binding to the receptor. Stabilization by such 
ligand-independent means is termed “constitutive receptor activation.” 

II. Serotonin receptors 

Receptors for serotonin (5-hydroxytryptamine, 5-HT) are an important class of G 
protein-coupled receptors. Serotonin is thought to play a role in processes related to learning 
10 and memory, sleep, thermoregulation, mood, motor activity, pain, sexual and aggressive 
behaviors, appetite, neurodegenerative regulation, and biological rhythms. Not surprisingly, 
serotonin is linked to pathophysiological conditions such as anxiety, depression, obsessive- 
compulsive disorders, schizophrenia, suicide, autism, migraine, emesis, alcoholism and 
neurodegenerative disorders. 

15 Serotonin receptors are divided into seven subfamilies, referred to as 5-HT1 through 5- 

HT7, inclusive. These subfamilies are further divided into subtypes. For example, the 5-HT2 
subfamily is divided into three receptor subtypes: 5-HT2A, 5-HT2B, and 5-HT2C. The human 
5-HT2C receptor was first isolated and cloned in 1987, and the human 5-HT2A receptor was 
first isolated and cloned in 1990. These two receptors are thought to be the site of action of 
20 hallucinogenic drugs. Additionally, antagonists to the 5-HT2A and 5-HT2C receptors are 
believed to be useful in treating depression, anxiety, psychosis and eating disorders. 

U.S. Patent Number 4,985,352, describes the isolation, characterization, and expression 
of a functional cDNA clone encoding the entire human 5-HT1C receptor (now known as the 
5HT2C receptor). U.S. Patent Number 5,661,0124 describes the isolation, characterization, 
25 and expression of a functional cDNA clone encoding the entire human 5-HT2A receptor. 

Mutations of the endogenous forms of the rat 5-HT2A and rat 5-HT2C receptors have 
been reported to lead to constitutive activation of these receptors (5-HT2A: Casey, C. et al. 
( 1 996) Society for Neuroscience Abstracts, 22:699.10, hereinafter “Casey”; 5-HT2C: Herrick- 
Davis, K., and Teitler, M. (1996) Society for Neuroscience Abstracts, 22:699.18, hereinafter 
30 “Herrick-Davis 1”; and Herrick-Davis, K. et al. (1997) J Neurochemistry 69(3): 1138, 
hereinafter “Hemck-Davis-2”). Casey describes a mutation of the cysteine residue at position 
322 of the rat 5-HT2A receptor to lysine (C322K), glutamine (C322Q) and arginine (C322R) 
which reportedly led to constitutive activation. Herrick-Davis 1 and Herrick-Davis 2 describe 
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mutations of the serine residue at position 312 of the rat 5-HT2C receptor to phenylalanine 
(S312F) and lysine (S312K), which reportedly led to constitutive activation. 

SUMMARY OF THE INVENTION 

5 The present invention relates to non-endogenous, constitutively activated forms of 

the human 5-HT2A and human 5-HT2C receptors and various uses of such receptors. 
Further disclosed are small molecule modulators of these receptors. Most preferably, these 
modulators have inverse agonist characteristics at the receptor. 

More specifically, the present invention discloses nucleic acid molecules and the 
10 proteins for three non-endogenous, constitutively activated human serotonin receptors, referred 
to herein as, AP-1, AP-3, and AP-4. The AP-1 receptor is a constitutively active form of the 
human 5-HT2C receptor created by an S310K point mutation. The AP-3 receptor is a 
constitutively active form of the human 5-HT2A receptor whereby the intracellular loop 3 
(IC3) portion and the cytoplasmic-tail portion of the endogenous human 5-HT2A receptor have 
15 been replaced with the IC3 portion and the cytoplasmic-tail portion of the human 5-HT2C 
receptor. The AP-4 receptor is a constitutively active form of the human 5-HT2A receptor 
whereby (1) the region of the intracellular third loop between the proline of the transmembrane 
5 region (TM5) and the proline of TM6 of the endogenous human 5-HT2A receptor has been 
replaced with the corresponding region of the human 5-HT2C receptor (including a S310K 
20 point mutation); and (2) the cytoplasmic-tail portion of the endogenous human 5-HT2A 
receptor has been replaced with the cytoplasmic-tail portion of the endogenous human 5-HT2C 
receptor. 

The invention also provides assays that may be used to directly identify candidate 
compounds as agonists, partial agonists or inverse agonists to non-endogenous, constitutively 
25 activated human serotonin receptors; such candidate compounds can then be utilized in 
pharmaceutical composition^) for treatment of diseases and disorders which are related to the 
human 5-HT2A and/or human 5-HT2C receptors. 

These and other aspects of the invention disclosed herein will be set forth in greater 
detail as the patent disclosure proceeds. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

In the following figures, bold typeface indicates the location of the mutation in the non- 
endogenous, constitutively activated receptor relative to the corresponding endogenous 
receptor. 
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Figure 1 shows a generalized structure of a G protein-coupled receptor with the 
numbers assigned to the transmembrane helices, the intracellular loops, and the extracellular 
loops. 

Figure 2 schematically shows the active and inactive states for a typical G protein- 
5 coupled receptor and the linkage of the active state to the second messenger transduction 
pathway. 

Figure 3a provides the nucleic acid sequence of the endogenous human 5-HT2A 
receptor (SEQ.ID.NO: 24). 

Figure 3b provides the corresponding amino acid sequence of the endogenous human 
1 0 5-HT2A receptor (SEQ.ID.NO: 25). 

Figure 4a provides the nucleic acid sequence of the endogenous human 5-HT2C 
receptor (SEQ.ID.NO: 26). 

Figure 4b provides the corresponding amino acid sequence of the endogenous human 
5-HT2C receptor (SEQ.ID.NO: 27). 

15 Figure 5a provides the nucleic acid sequence of a constitutively active form of the 

human 5-HT2C receptor (“AP-1 cDNA” - SEQ.ID.NO: 28). 

Figure 5b provides the corresponding amino acid sequence of the AP-1 cDNA (“AP- 
1” - SEQ.ID.NO: 29). 

Figure 6a provides the nucleic acid sequence of a constitutively active form of the 
20 human 5-HT2A receptor whereby the IC3 portion and the cytoplasmic-tail portion of the 
endogenous 5-HT2A receptor have been replaced with the IC3 portion and the cytoplasmic-tail 
portion of the human 5-HT2C receptor (“AP-3 cDNA” - SEQ.ID.NO: 30). 

Figure 6b provides the corresponding amino acid sequence of the AP-3 cDNA (“AP- 
3”- SEQ.ID.NO: 31). 

25 Figure 6c provides a schematic representation of AP-3, where the dashed-lines 

represent the portion obtained from the human 5-HT2C receptor. 

Figure 7a provides the nucleic acid sequence of a constitutively active form of the 
human 5-HT2A receptor whereby (1) the region of the between the proline of TM5 and the 
proline of TM6 of the endogenous human 5-HT2A receptor has been replaced with the 
30 corresponding region of the human 5-HT2C receptor (including a S3 1 OK point mutation); and 
(2) the cytoplasmic-tail portion of the endogenous 5-HT2A receptor has been replaced with the 
cytoplasmic-tail portion of the endogenous human 5-HT2C receptor (“AP-4 cDNA” - 
SEQ.ID.NO:32). 
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Figure 7b provides the corresponding amino acid sequence of the AP-4 cDNA (“AP- 
4” - SEQ.ID.NO: 33). 

Figure 7c provides a schematic representation of the mutated 5-HT2A receptor of 
Figure 7b where the dashed-lines represent the portion obtained from the human 5-HT2C 
5 receptor. 

Figure 8 is a representation of the preferred vector, pCMV , used herein. 

Figure 9 is a diagram illustrating (1) enhanced [ 35 S]GTPyS binding to membranes 
prepared from COS cells expressing the endogenous human 5-HT2C receptor in response to 
serotonin, and (2) inhibition by mianserin using wheatgerm agglutinin scintillation proximity 
io beads. The concentration of [ 35 S]GTPyS was held constant at 0.3 nM, and the concentration of 
GDP was held at 1 pM. The concentration of the membrane protein wets 12.5 pg. 

Figure 10 is a diagram showing serotonin stimulation of [ 35 S]GTPyS binding to 
membranes expressing AP-1 receptors in 293T cells and the inhibition by 30 pM mianserin on 
Wallac™ scintistrips. 

15 Figure 11 is a diagram showing the effects of protein concentration on [ 35 S]GTPyS 

binding in membranes prepared from 293T cells transfected with the endogenous human 5- 
HT2C receptors and AP-1 receptors compared to cells transfected with the control vector 
(pCMV) alone in the absence (A) and presence (B) of 10 pM serotonin. The radiolableled 
concentration of [ 35 S]GTPyS was held constant at 0.3 nM, and the GDP concentration was held 
20 constant at 1 pM. The assay was performed on 96-well format on Wallac™ scintistrips. 

Figure 12 provides bar-graph comparisons of inositol trisphosphate (“IP3”) 
production between the endogenous human 5HT2A receptor and AP-2, a mutated form of 
the receptor. 

Figure 13 provides bar-graph comparisons of inositol trisphosphate (“IP3”) 
25 production between the endogenous human 5HT2A receptor and AP-4, a mutated form of 
the receptor. 

Figure 14 provides bar graph comparisons of IP3 production between the endogenous 
human 5-HT2A receptor and AP-3, a mutated form of the receptor. 

Figure 15 provides bar-graph comparisons of IP3 production between the endogenous 
30 human 5-HT2C receptor and AP-1 . 

Figures 16A-C provides representative auoradiograms showing displacement of I 125 - 
LSD from brain sections by spiperone and compound 116100. 

Figure 17 shows in vivo response of animals to 116102 exposure. 
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DEFINITIONS 

The scientific literature that has evolved around receptors has adopted a number of 
terms to refer to ligands having various effects on receptors. For clarity and consistency, the 
following definitions will be used throughout this patent document. To the extent that these 
definitions conflict with other definitions for these terms, the following definitions shall 
control. 

AGONISTS shall mean moieties that activate the intracellular response when they 
bind to the receptor, or enhance GTP binding to membranes. 

AMINO ACID ABBREVIATIONS used herein are set out in Table 1 : 


TABLE 1 


ALANINE 

ALA 

A 

ARGININE 

ARG 

R 

ASPARAGINE 

ASN 

.... 

N 

i - _ _ 

ASPARTIC ACID 

ASP 

D 

f 

CYSTEINE 

CYS 

C 

GLUTAMIC ACID 

GLU 

E 

GLUTAMINE 

GLN 

Q 

GLYCINE 

GLY 

G 

HISTIDINE 

HIS 

H 

ISOLEUCINE 

ILE 

I 

LEUCINE 

LEU 

L 

LYSINE 

LYS 

K 

METHIONINE 

MET 

M 

PHENYLALANINE 

PHE 

F 

PROLINE 

PRO 

P 
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SERINE 

SER 

S 

THREONINE 

THR 

T 

TRYPTOPHAN 

TRP 

W 

TYROSINE 

TYR 

Y 

VALINE 

VAL 

V 


PARTIAL AGONISTS shall mean moieties which activate the intracellular response 
when they bind to the receptor to a lesser degree/extent than do agonists, or enhance GTP 
binding to membranes to a lesser degree/extent than do agonists. 

5 ANTAGONIST shall mean moieties that competitively bind to the receptor at the 

same site as the agonists but which do not activate the intracellular response initiated by the 
active form of the receptor, and can thereby inhibit the intracellular responses by agonists or 
partial agonists. ANTAGONISTS do not diminish the baseline intracellular response in the 
absence of an agonist or partial agonist. 

10 CANDIDATE COMPOUND shall mean a molecule (for example, and not limitation, 

a chemical compound) which is amenable to a screening technique. 

COMPOUND EFFICACY shall mean a measurement of the ability of a compound to 
inhibit or stimulate receptor functionality, as opposed to receptor binding affinity. 

CONSTITUTIVELY ACTIVATED RECEPTOR shall mean a receptor subject to 
1 5 constitutive receptor activation. 

CONSTITUTIVE RECEPTOR ACTIVATION shall mean stabilization of a 
receptor in the active state by means other than binding of the receptor with its endogenous 
ligand or a chemical equivalent thereof. 

CONTACT or CONTACTING shall mean bringing at least two moieties together, 
20 whether in an in vitro system or an in vivo system. 

ENDOGENOUS shall mean a material that a mammal naturally produces. 
ENDOGENOUS in reference to, for example and not limitation, the term “receptor” shall 
mean that which is naturally produced by a mammal (for example, and not limitation, a 
human) or a virus. 

25 In contrast, the term NON-ENDOGENOUS in this context shall mean that which is 

not naturally produced by a mammal (for example, and not limitation, a human) or a virus. For 
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example, and not limitation, a receptor which is not constitutively active in its endogenous 
form, but when manipulated becomes constitutively active, is most preferably referred to 
herein as a “non-endogenous, constitutively activated receptor.” Both terms can be utilized to 
describe both “in vivo” and “in vitro” systems. For example, and not a limitation, in a 
5 screening approach, the endogenous or non-endogenous receptor may be in reference to an in 
vitro screening system. As a further example and not limitation, where the genome of a 
mammal has been manipulated to include a non-endogenous constitutively activated receptor, 
screening of a candidate compound by means of an in vivo system is viable. 

INHIBIT or INHIBITING, in relationship to the term “response” shall mean that a 
1 0 response is decreased or prevented in the presence of a compound as opposed to in the absence 
of the compound. 

INVERSE AGONISTS shall mean moieties that bind the endogenous form of the 
receptor or to the constitutively activated form of the receptor, and which inhibit the baseline 
intracellular response initiated by the active form of the receptor below the normal base level 
15 of activity which is observed in the absence of agonists or partial agonists, or decrease GTP 
binding to membranes. Preferably, the baseline intracellular response is inhibited in the 
presence of the inverse agonist by at least 30%, more preferably by at least 50%, and most 
preferably by at least 75%, as compared with the baseline response in the absence of the 
inverse agonist. 

20 LIGAND shall mean an endogenous, naturally occurring molecule specific for an 

endogenous, naturally occurring receptor. 

PHARMACEUTICAL COMPOSITION shall mean a composition comprising at 
least one active ingredient, whereby the composition is amenable to investigation for a 
specified, efficacious outcome in a mammal (for example, and not limitation, a human). Those 
25 of ordinary skill in the art will understand and appreciate the techniques appropriate for 
determining whether an active ingredient has a desired efficacious outcome based upon the 
needs of the artisan. 

STIMULATE or STIMULATING, in relationship to the term “response” shall mean 
that a response is increased in the presence of a compound as opposed to in the absence of the 
30 compound. 

DETAILED DESCRIPTION 
I. Particularly preferred mutations 
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For convenience, the sequence information regarding the non-endogenous, 
constitutively active human 5-HT2A and 5-HT2C receptors are referred to by identifiers as set 
forth in Table 2: 


TABLE 2 


IDENTIFIER 

RECEPTOR 

SEQ.ID.NO: 

FIGURE 

AP-1 cDNA 

5-HT2C 

28 

5a 

AP-1 

5-HT2C 

29 

5b 

AP-3 cDNA 

5-HT2A 

30 

6a 

AP-3 

5-HT2A 

31 

6b 

AP-4 cDNA 

5-HT2A 

32 

7a 

AP-4 

5-HT2A 

33 

7b 


5 As will be discussed in greater detail below, a mutation analogous to that reported by Casey 
(C322K) was utilized in the human 5-HT2A receptor and is referred to herein as AP-2. 
However, AP-2 did not lead to sufficient constitutive activation to allow for utilization in 
screening techniques. 

II. Introduction 

10 While it is sometimes possible to make predictions as to the effect of nucleic acid 

manipulation from one species to another, this is not always the case. The results reported by 
Casey suggest that a point mutation in the rat 5-HT2A receptor evidences constitutive 
activation of the mutated receptor. Casey reports that the C322K mutation was approximately 
four fold more active than the native rat 5-HT2A receptor. However, for purposes of a most 
15 preferred use, i.e., screening of candidate compounds, this corresponding mutation in the 
human 5-HT2A receptor had little discemable effect in evidencing constitutive activation of 
the human receptor. This, of course, creates the reasonable conclusion that the information 
reported in Herrick-Davis 1 or Herrick-Davis 2 is of limited predictive value relative to the 
manipulation of the human 5-HT2C receptor. Consequently, the ability to make reasonable 
20 predictions about the effects of mutations to the rat 5-HT receptors vis-a-vis the corresponding 
human receptors is not possible. Nonetheless, this unfortunate lack of reasonable predictability 
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provides the opportunity for others to discover mutations to the human 5-HT receptors that 
provide evidence of constitutive activation. 

Therefore, the present invention is based upon the desire of defining mutated 
sequences of the human serotonin receptors 5-HT2A and 5-HT2C whereby such mutated 
5 versions of the expressed receptor are constitutively active. These constitutively active 
receptors allow for, inter alia, screening candidate compounds. 

What has been discovered and disclosed herein is that substantial activation of the 
human 5-HT2A receptor can be obtained by “domain swapping,” i.e., by switching the third 
intracellular domain of the 5-HT2A receptor with the third intracellular domain of the 5- 
10 HT2C receptor. Additionally, swapping the cytoplasmic tail of the two receptors further 
increases the IP3 response. Furthermore, mutation of the serine at position 310 to lysine 
(S3 1 OK) of the human 5-HT2C receptor leads to constitutive activation. 

What follows is a most preferred approach to identification of candidate compounds; 
those in the art will readily appreciate that the particular order of screening approaches, 
15 and/or whether or not to utilize certain of these approaches, is a matter of choice. Thus, the 
order presented below, set for presentational efficiency and for indication of the most 
preferred approach utilized in screening candidate compounds, is not intended, nor is to be 
construed, as a limitation on the disclosure, or any claims to follow. 

III. Generic G Protein-Coupled Receptor screening assay techniques 
20 When a G protein receptor becomes constitutively active, it binds to a G protein (Gq, 

Gs, Gi, Go) and stimulates the binding of GTP to the G protein. The G protein then acts as a 
GTPase and slowly hydrolyzes the GTP to GDP, whereby the receptor, under normal 
conditions, becomes deactivated. However, constitutively activated receptors continue to 
exchange GDP to GTP. A non-hydrolyzable analog of GTP, [ 35 S]GTPyS, can be used to 
25 monitor enhanced binding to membranes which express constitutively activated receptors. It is 
reported that [ 35 S]GTPyS can be used to monitor G protein coupling to membranes in the 
absence and presence of ligand. An example of this monitoring, among other examples well- 
known and available to those in the art, was reported by Traynor and Nahorski in 1995. The 
preferred use of this assay system is for initial screening of candidate compounds because the 
30 system is generically applicable to all G protein-coupled receptors regardless of the particular 
G protein that interacts with the intracellular domain of the receptor. 
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IV. Confirmation of G Protein-Coupled Receptor site screening assay techniques 

Once candidate compounds are identified using the “generic” G protein-coupled 
receptor assay (i.e. an assay to select compounds that are agonists, partial agonists, or inverse 
agonists), further screening to confirm that the compounds have interacted at the receptor site 
5 is preferred. For example, a compound identified by the “generic” assay may not bind to the 
receptor, but may instead merely “uncouple” the G protein from the intracellular domain. 
Thus, by further screening those candidate compounds, which have been identified using a 
“generic” assay in an agonist and/or antagonist competitive binding assay, further refinement 
in the selection process is provided. 

1 o Lysergic acid diethylamide (LSD) is a well-known agonist of the 5-HT2A and 5-HT2C 

receptors, while mesulergine is a well-known antagonist to the 5-HT2C receptor. Accordingly, 
in most preferred embodiments, an agonist (LSD) and/or antagonist (mesulergine) competitive 
binding assay(s) is used to further screen those compounds selected from the “generic” assay 
for confirmation of serotonin receptor binding. 

1 5 V. Specified G Protein assay techniques 

The art-accepted physiologically mediated pathway for the human 5-HT2A and 5- 
HT2C receptors is via Gq. Intracellular accumulation of IP3 can be used to confirm 
constitutive activation of these types of Gq coupled receptors (see Herrick-Davis-1). As a 
result, “IP3 accumulation” assays can be used to further screen those compounds selected from 
20 an agonist and/or antagonist competitive binding assay. 

VI. Pharmaceutical compositions 

Candidate compounds selected for further development can be formulated into 
pharmaceutical compositions using techniques well known to those in the art. Suitable 
pharmaceutically-acceptable carriers are available to those in the art; for example, see 
25 Remington’s Pharmaceutical Sciences, 16 th Edition, 1980, Mack Publishing Co., (Oslo et al., 
eds.) 

EXAMPLES 

The following examples are presented for purposes of elucidation, and not 
limitation, of the present invention. While specific nucleic acid and amino acid sequences 
30 are disclosed herein, those of ordinary skill in the art are credited with the ability to make 
minor modifications to these sequences while achieving the same or substantially similar 
results reported below. It is intended that equivalent, non-endogenous, constitutively 
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activated human serotonin receptor sequences having eighty-five percent (85%) homology, 
more preferably having ninety percent (90%) homology, and most preferably having ninety- 
five percent (95%) homology to the disclosed and claimed sequences all fall within the 
scope of any claims appended hereto. 

5 Example 1 

Generation ofNon-Endogenous, Constitutively Activated 
Human Serotonin Receptors 5-HT2C and 5-HT2A 


10 

A. Construction of constitutively active 5-HT2C receptor cDNA 
1. Endogenous Human 5-HT2C 

The cDNA encoding endogenous human 5-HT2C receptor was obtained from 
human brain poly-A + RNA by RT-PCR. The 5’ and 3’ primers were derived from the 5’ 
15 and 3 ’ untranslated regions and contained the following sequences: 

5’-GACCTCGAGGTTGCTTAAGACTGAAGCA-3’ (SEQ.ID.NO:l) 

5 ’-ATTTCTAGACATATGTAGCTTGTACCGT-3 ’ (SEQ.ID.NO:2) 

PCR was performed using either TaqPlus™ precision polymerase (Stratagene) or rTth™ 
polymerase (Perkin Elmer) with the buffer systems provided by the manufacturers, 0.25 pM of 
20 each primer, and 0.2 mM of each of the four (4) nucleotides. The cycle condition was 30 
cycles of 94°C for 1 minute, 57 °C for 1 minute and 72 °C for 2 minutes. The 1 .5 kb PCR 
fragment was digested with Xho I and Xba I and subcloned into the Sal I-Xba I site of 
pBluescript. 

The derived cDNA clones were fully sequenced and found to correspond to 
25 published sequences. 
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2. AP-J cDNA 

The cDNA containing a S310K mutation (AP-1 cDNA) in the third intracellular loop 
of the human 5-HT2C receptor was constructed by replacing the Sty I restriction fragment 
containing amino acid 310 with synthetic double stranded oligonucleotides encoding the 
5 desired mutation. The sense strand sequence utilized had the following sequence: 

5’- 

CTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTAAGAAAGTC-3’ 
(SEQ. ID.NO: 3) 

and the antisense strand sequence utilized had the following sequence: 

10 5 ’ -C AAGG ACTTTCTTAGCTTTTCTTTCATTGTTG ATAGCCTGC ATGGT 

GCCC-3’ (SEQ. ID. NO: 4). 

B. Construction of constitutively active 5-HT2A receptor cDNA 

1. Endogenous Human 5-HT2A 

The cDNA encoding endogenous human 5-HT2A receptor was obtained by RT-PCR 
15 using human brain poly-A + RNA; a 5’ primer from the 5’ untranslated region with a Xho I 
restriction site: 

5^GACCTCGAGTCCTTCTACACCTCATC-3’ (SEQ.ID.NO:5) 
and a 3’ primer from the 3’ untranslated region containing an Xba I site: 

S’-TGCTCTAGATTCCAGATAGGTGAAAA CTTG-3’ (SEQ.ID.NO:6). 

20 PCR was performed using either TaqPlus™ precision polymerase (Stratagene) or rTth™ 
polymerase (Perkin Elmer) with the buffer systems provided by the manufacturers, 0.25 pM 
of each primer, and 0.2 mM of each of the four (4) nucleotides. The cycle condition was 30 
cycles of 94°C for 1 minute, 57 °C for 1 minute and 72 °C for 2 minutes. The 1.5 kb PCR 
fragment was digested with Xba 1 and subcloned into the Eco RV-Xba I site of pBluescript. 
25 The resulting cDNA clones were fully sequenced and found to encode two amino 

acid changes from the published sequences. The first change is a T25N mutation in the N- 
terminal extracellular domain and the second change is an H452Y mutation. These 
mutations are likely to represent sequence polymorphisms rather than PCR errors since the 
cDNA clones having the same two mutations were derived from two independent PCR 
30 procedures using Taq polymerase from two different commercial sources (TaqPlus™ 
Stratagene and rTth™ Perkin Elmer). 

2. Human 5-HT2A (C322K;AP-2) 
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The cDNA containing the point mutation C322K in the third intracellular loop was 
constructed by using the Sph I restriction enzyme site, which encompasses amino acid 322. 
For the PCR procedure, a primer containing the C322K mutation: 

5’-CAAAGAAAGTACTGGGCATCGTCTTCTTCCT-3’ (SEQ.ID.NO:7) 

5 was used along with the primer from the 3’ untranslated region set forth above as 
SEQ.ID.NO:6. The resulting PCR fragment was then used to replace the 3’ end of the wild 
type 5-HT2A cDNA by the T4 polymerase blunted Sph I site. PCR was performed using 
pfu polymerase (Stratagene) with the buffer system provided by the manufacturer and 10% 
DMSO, 0.25 mM of each primer, 0.5mM of each of the 4 nucleotides. The cycle conditions 
were 25 cycles of 94°C for 1 minute, 60°C for 1 minute and 72°C for 1 minute. 


10 
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3. AP-3 cDNA 

The human 5-HT2A cDNA with intracellular loop 3 (IC3) or IC3 and cytoplasmic 
tail replaced by the corresponding human 5-HT2C cDNA was constructed using PCR-based 
mutagenesis. 

5 (a) Replacement of IC3 Loop 

The IC3 loop of human 5-HT2A cDNA was first replaced with the corresponding 
human 5-HT2C cDNA. Two separate PCR procedures were performed to generate the two 
fragments, Fragment A and Fragment B, that fuse the 5-HT2C IC3 loop to the 
transmembrane 6 (TM6) of 5-HT2A. The 237 bp PCR fragment. Fragment A, containing 5- 
10 HT2C IC3 and the initial 13 bp of 5-HT2A TM6 was amplified by using the following 
primers: 

5 ’ -CCGCTCGAGTACTGCGCCGAC AAGCTTTG AT -3 ’ (SEQ.ID.NO:8) 

5 ’ -CGAT GCCCAGC ACTTTCGAAGCTTTTCTTT C ATT GTTG3 ’ (SEQ.ID.NO:9) 
The template used was human 5-HT2C cDNA. 

15 The 529 bp PCR fragment. Fragment B, containing the C-terminal 13 bp of IC3 

from 5-HT2C and the C-terminal of 5-HT2A starting at beginning of TM6, was amplified 
by using the following primers: 

5’-AAAAGCTTCGAAAGTGCTGGGCATCGTCTTCTTCCT-3’ (SEQ.ID.NO:10) 

5 ’ -TGCT CTAG ATTCC AGAT AGGTG AAAACTT G-3 ’ (SEQ.1D.NO: 11) 

20 The template used was human 5-HT2A cDNA. 

Second round PCR was performed using Fragment A and Fragment B as co- 
templates with SEQ.ID.NO:8 and SEQ.ID.NO:ll (it is noted that the sequences for 
SEQ.ID.NOS.: 6 and 11 are the same) as primers. The resulting 740 bp PCR fragment, 
Fragment C, contained the IC3 loop of human 5-HT2C fused to TM6 through the end of the 
25 cytoplasmic tail of human 5-HT2A. PCR was performed using pfu™ polymerase 
(Stratagene) with the buffer system provided by the manufacturer, and 10% DMSO, 0.25 mM 
of each primer, and 0.5 mM of each of the four (4) nucleotides. The cycle conditions were 25 
cycles of 94 °C for 1 minute, 57 °C (1st round PCR) or 60 °C (2nd round PCR) for 1 minute, 
and 72 °C for 1 minute (1st round PCR) or 90 seconds. (2nd round PCR). 

30 To generate a PCR fragment containing a fusion junction between the human 5- 

HT2A TM5 and the IC3 loop of 5-HT2C, four (4) primers were used. The two external 
primers, derived from human 5-HT2A, had the following sequences: 

5’-CGTGTCTCTCCTTACTTCA-3’ (SEQ.ID.NO.T2) 



WO 99/52927 


PCT/US99/08168 


16 

The other primer used was SEQ.ID.NO.6 (see note above regarding SEQ.ID.NOS. 6 and 
1 1 ). The first internal primer utilized was an antisense strand containing the initial 1 3 bp of 
IC3 of 5-HT2C followed by the terminal 23 bp derived from TM5 of 5-HT2A: 

5’-TCGGCGCAGTACTTTGATAGTTAGAAAGTAGGTGAT-3 ’ (SEQ.ID.NO: 13) 
5 The second internal primer was a sense strand containing the terminal 14 bp derived 

from TM5 of 5-HT2A followed by the initial 24 bp derived from IC3 of 5-HT2C: 

5’-TTCTAACTATCAAAGTACTGCGCCGACAAGCTTTGATG-3’ 

(SEQ.ID.NO: 14). 

PCR was performed using endogenous human 5-HT2A and a co-template, Fragment 
10 C, in a 50 ml reaction volume containing IX pfu buffer, 10% DMSO, 0.5 mM of each of the 
four (4) nucleotides, 0.25 mM of each external primer (SEQ.ID.NOS. 1 1 and 12), 0.06 mM 
of each internal primer (SEQ.ID.NOS. 13 and 14) and 1.9 units of pfu polymerase 
(Stratagene). The cycle conditions were 25 cycles of 94°C for 1 minute, 52°C for 1 minute 
and 72 °C for 2 minutes and 10 seconds. The 1.3 kb PCR product was then gel purified and 
15 digested with Pst I and Eco RI. The resulting 1 kb Pstl-Eco RI fragment was used to 
replace the corresponding fragment in the endogenous human 5-HT2A sequence to generate 
the mutant 5-HT2A sequence encoding the IC3 loop of 5-HT2C. 

(b) Replacement of the cytoplasmic tail 

To replace the cytoplasmic tail of 5-HT2A with that of 5-HT2C, PCR was 
20 performed using a sense primer containing the C-terminal 22 bp of TM7 of endogenous 
human 5-HT2A followed by the initial 21 bp of the cytoplasmic tail of endogenous human 
5-HT2C: 

5’-TTCAGCAGTCAACCCACTAGTCTATACTCTGTTCAACAAAATT-3’ 
(SEQ.ID.NO: 15) 

25 The antisense primer was derived from the 3’ untranslated region of endogenous human 5- 
HT2C: 

5 ’-ATTTCTAGAC ATATGTAGCTTGTACCGT-3 ’ (SEQ.ID.NO: 1 6). 

The resulting PCR fragment, Fragment D, contained the last 22 bp of endogenous 
human 5-HT2A TM7 fused to the cytoplasmic tail of endogenous human 5-HT2C. Second 
30 round PCR was performed using Fragment D and the co-template was endogenous human 
5-HT2A that was previously digested with Acc I to avoid undesired amplification. The 
antisense primer used was SEQ.lD.NO:16 (the sequences for SEQ.ID.NOS. 16 and 2 are the 
same) and the sense primer used was derived from endogenous human 5-HT2A: 
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5 ’-ATCACCTACTTTCTAACTA-3 ’ (SEQ.ID.NO: 1 7). 

PCR conditions were as set forth in Example lB3.(a) for the first round PCR, except 
that the annealing temperature was 48 °C and the extension time was 90 seconds. The 
resulting 710 bp PCR product was digested with Apa I and Xba I and used to replace the 
5 corresponding Apa I-Xba I fragment of either (a) endogenous human 5-HT2A, or (b) 5- 
HT2A with 2C IC3 to generate (a) endogenous human 5-HT2A with endogenous human 5- 
HT2C cytoplasmic tail and (b) AP-3, respectively. 

4. AP-4 cDNA 

10 This mutant was created by replacement of the region of endogenous human 5- 

HT2A from amino acid 247, the middle of TM5 right after Pro 245 , to amino acid 337, the 
middle of TM6 just before Pro 338 , by the corresponding region of AP-1 cDNA. For 
convenience, the junction in TM5 is referred to as the “2A-2C junction,” and the junction in 
TM6 is referred to as the “2C-2A junction.” 

15 Three PCR fragments containing the desired hybrid junctions were generated. The 

5’ fragment of 561 bp containing the 2A-2C junction in TM5 was generated by PCR using 
endogenous human 5-HT2A as template, SEQ.ID.NO: 12 as the sense primer, and the 
antisense primer was derived from 13 bp of 5-HT2C followed by 20 bp of 5-HT2A 
sequence: 

20 5 ’ -CCATAATCGTCAGGGGAATGAAAAATGACACAA-3 ’ (SEQ.ID.NO: 18) 

The middle fragment of the 323 bp contains endogenous human 5-HT2C sequence 
derived from the middle of TM5 to the middle of TM6, flanked by 13 bp of 5-HT2A 
sequences from the 2A-2C junction and the 2C-2A junction. This middle fragment was 
generated by using AP-1 cDNA as a template, a sense primer containing 13 bp of 5-HT2A 

25 followed by 20 bp of 5-HT2C sequences across the 2A-2C junction and having the 
sequence: 

5 , -ATTTTTCATTCCCCTGACGATTATGGTGATTAC-3 , (SEQ.ID.NO:19); 

and an antisense primer containing 13 bp of 5-HT2A followed by 20 bp of 5-HT2C 
sequences across the 2C-2A junction and having the sequence: 

30 5 ’ -T GAT G AAG AAAGGGC ACC AC AT G ATC AG AAAC A-3 ’ (SEQ.ID.NO:20). 

The 3’ fragment of 487 bp containing the 2C-2A junction was generated by PCR using 
endogenous human 5-HT2A as a template and a sense primer having the following 
sequence from the 2C-2A junction: 
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5 ’-GATC ATGTGGTGCCCTTTCTTCATC AC AAAC AT-3 ’ (SEQ.ID.NO:2 1 ) 
and the antisense primer was SEQ.ID.NO:6 see note above regarding SEQ.ID.NOS. 6 and 
11 ). 

Two second round PCR reactions were performed separately to link the 5’ and 
5 middle fragment (5’M PCR) and the middle and 3’ fragment (M3’ PCR). The 5’M PCR co- 
template used was the 5’ and middle PCR fragment as described above, the sense primer 
was SEQ.ID.NO:12 and the antisense primer was SEQ.ID.NO:20. The 5’M PCR procedure 
resulted in an 857 bp PCR fragment. 

The M3’ PCR used the middle and M3’ PCR fragment described above as the co- 
10 template, SEQ.ID.NO: 19 as the sense primer and SEQ.ID.NO:6 (see note above regarding 
SEQ.ID.NOS. 6 and 11) as the antisense primer, and generated a 784 bp amplification 
product. The final round of PCR was performed using the 857 bp and 784 bp fragments 
from the second round PCR as the co-template, and SEQ.ID.NO:12 and SEQ.ID.NO: 6 (see 
note above regarding SEQ.ID.NOS. 6 and 11) as the sense and the antisense primer, 
15 respectively. The 1 .32 kb amplification product from the final round of PCR was digested 
with Pst I and Eco RI. Then resulting 1 kb Pst I-Eco RI fragment was used to replace the 
corresponding fragment of the endogenous human 5-HT2A to generate mutant 5-HT2A 
with 5-HT2C: C310K/IC3. The Apa I-Xba fragment of AP3 was used to replace the 
corresponding fragment in mutant 5-HT2A with 5-HT2C: C310K/IC3 to generate AP4. 

20 Example 2 

Receptor Expression 


A. pCMV 

Although a variety of expression vectors are available to those in the art, for 
25 purposes of utilization for both the endogenous and non-endogenous receptors discussed 
herein, it is most preferred that the vector utilized be pCMV. This vector was deposited 
with the American Type Culture Collection (ATCC) on October 13, 1998 (10801 University 
Blvd.. Manassas, VA 201 10-2209 USA) under the provisions of the Budapest Treaty for the 
International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
30 Procedure. The DNA was tested by the ATCC and determined to be viable. The ATCC has 
assigned the following deposit number to pCMV: ATCC #203351 . See Figure 8. 

B. Transfection procedure 
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12-well plates). Cells were permitted to adhere to the wells, followed by one wash with 
lxPBS. Thereafter, 0.5 pCi 3 H-inositol in 1ml inositol-free DMEM was added per well. 
Protocol B. 

On day 4, the cells were washed with 0.5 ml PBS and then 0.45 ml of assay medium 
5 was added containing inositol-free/serum free media, 10 pM pargyline, 10 mM lithium 
chloride, or 0.4 ml of assay medium and 50 ul of lOx ketanserin (ket) to a final 
concentration of lOpM. The cells were then incubated for 30 minutes at 37°C. Then the 
cells were washed with 0.5 ml PBS and 200 ul of fresh/icecold stop solution (1M KOH; 18 
mM Na-borate; 3.8 mM EDTA) was added/well. The solution was kept on ice for 5-10 
10 minutes or until the cells were lysed and then neutralized by 200 pi of fresh/ice cold 
neutralization sol. (7.5 % HCL). The lysate was then transferred into 1.5 ml micro- 
centrifuge tubes and 1 ml of chloroform/methanol (1:2) was added/tube. The solution was 
vortexed for 15 seconds and the upper phase was applied to a Biorad AG1-X8 anion 
exchange resin ( 100-200 mesh). The resin was washed with water and 0.9 ml of the upper 
15 phase was loaded onto the column. The column was washed with 10 mis of 5 mM myo- 
inositol and 10 ml of 5 mM Na-borate/60mM Na-formate. The inositol trisphosphates were 
eluted into scintillation vials containing 10 ml of scintillation cocktail with 2 ml of 0.1 M 
formic acid/ 1 M ammonium formate. The columns were regenerated by washing with 10 
ml of 0.1 M formic acid/3M ammonium formate and rinsed twice with dd H 2 O and stored at 
20 room temperature in water. Results are discussed below. 

Figure 12 is an illustration of IP3 production from the human 5-HT2A receptor 
which was mutated using the same point mutation as set forth in Casey, which rendered the 
rat receptor constitutively active. The results represented in Figure 12, support the position 
that when the point mutation shown to activate the rat receptor is introduced into the human 
25 receptor, little activation of the receptor is obtained that would allow for appropriate 
screening of candidate compounds, with the response being only moderately above that of 
the endogenous human 5-HT2A receptor. Generally, a response of at least 2X above that of 
the endogenous response is preferred. 

Figure 13 provides an illustration comparing IP3 production from endogenous 5- 
30 HT2A receptor and the AP4 mutation. The results illustrated in Figure 13 support the 
position that when the novel mutation disclosed herein is utilized, a robust response of 
constitutive IP3 accumulation is obtained (e.g., over 2X that of the endogenous receptor). 
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Figure 14 provides an illustration of IP3 production from AP3. The results 
illustrated in Figure 14 support the position that when the novel mutation disclosed herein is 
utilized, a robust response of constitutive IP3 accumulation is obtained. 

Figure 1 5 provides bar-graph comparisons of IP3 accumulation between endogenous 
5 human 5-HT2C receptor and AP-1. Note that the endogenous receptor has a high degree of 
natural constitutive activity relative to the control CMV transfected cells (i.e., the endogenous 
receptor appears to be constitutively activated). 

Example 6 

i o Screening of Compounds Known to have 5-HT2C Antagonist Activity Against 

Non-Endogenous, Constitutively Activated 
Human Serotonin Receptor: AP-1 

A final concentration of 12.5 pg membranes prepared from COS7 cells (see 
15 Example 2) transiently expressing constitutively active mutant human 5HT2C receptor AP- 
1 were incubated with binding buffer (20 mM HEPES, pH 7.4, 100 mM NaCl, 20 mM 
MgCl2.6H20, 0.2% saponin, and 0.2 mM ascobate), GDP(lpM) and compound in a 96-well 
plate format for a period of 60 minutes at ambient room temperature. Plates were then 
centrifuged at 4,000 rpm for 15 minutes followed by aspiration of the reaction mixture and 
20 counting for 1 minute in a Wallac™ MicroBeta plate scintillation counter. A series of 
compounds known to possess reported 5HT2C antagonist activity were determined to be 
active in the [ 35 S]GTPyS binding assay using AP-1. IC 50 determinations were made for 
these commercially available compounds (RBI, Natick, MA). Results are summarized in 
Table 3. For each determination, eight concentrations of test compounds were tested in 
25 triplicate. The negative control in these experiments consisted of AP-1 receptor without test 
compound addition, and the positive control consisted of 12.5 pg/well of COS7 cell 
membranes expressing the CMV promoter without expressed AP-1 receptor. 


TABLE 3 

Test Compound 

Known Pharmacology 

IC 50 (nM) in GTP-y-[ 3 s S] 



Assay 

Metergoline 

5HT2/1C antagonist 

32.0 
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For the generic assay ([ 35 S]GTPyS; Example 3) and the antagonist binding assay 
(mesulergine; Example 4), transfection of COS-7 or 293T cells was accomplished using the 
following protocol. 

On day one, 5X10 6 COS-7 cells or 1X10 7 293T cells per 150mm plate were plated out. 

5 On day two, two reaction tubes were prepared (the proportions to follow for each tube are per 
plate): tube A was prepared by mixing 20pg DNA (e.g., pCMV vector; pCMV vector AP-1 
cDNA, etc.) in 1.2ml serum free DMEM (Irvine Scientific, Irvine, CA); tube B was prepared 
by mixing 120pl lipofectamine (Gibco BRL) in 1.2ml serum free DMEM. Tubes A and B 
were then admixed by inversions (several times), followed by incubation at room temperature 
10 for 30-45min. The admixture is referred to as the “transfection mixture”. Plated COS-7 cells 
were washed with IX PBS, followed by addition of 10ml serum free DMEM. 2.4ml of the 
transfection mixture was then added to the cells, followed by incubation for 4hrs at 37°C/5% 
C0 2 . The transfection mixture was then removed by aspiration, followed by the addition of 
25ml of DMEM/10% Fetal Bovine Serum. Cells were then incubated at 37°C/5% C0 2 . After 
1 5 72hr incubation, cells were then harvested and utilized for analysis. 

Example 3 

GTP Membrane Binding Scintillation Proximity Assay 

20 The advantages of using [ 35 S]GTPyS binding to measure constitutive activation are 

that: (a) [ 35 S]GTPyS binding is generically applicable to all G protein-coupled receptors; and 
(b) [ 35 S]GTPyS binding is proximal at the membrane surface, thereby making it less likely to 
pick-up molecules which affect the intracellular cascade. The assay utilizes the ability of G 
protein-coupled receptors to stimulate [ 35 S]GTPyS binding to membranes expressing the 
25 relevant receptors. Therefore, the assay may be used to directly screen compounds at the 
disclosed serotonin receptors. 

Figure 9 demonstrates the utility of a scintillation proximity assay to monitor the 
binding of [ 35 S]GTPyS to membranes expressing the endogenous human 5-HT2C receptor 
expressed in COS cells. In brief, the assay was incubated in 20 mM HEPES, pH 7.4, binding 
30 buffer with 0.3 nM [ 35 S]GTPyS and 12.5 pg membrane protein and 1 pM GDP for 30 minutes. 
Wheatgerm agglutinin beads (25 pi; Amersham) were then added and the mixture was 
incubated for another 30 minutes at room temperature. The tubes were then centrifuged at 1500 
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x g for 5 minutes at room temperature and then counted in a scintillation counter. As shown in 
Figure 9, serotonin, which as the endogenous ligand activates the 5-HT2C receptor, stimulated 
[ 35 S]GTPyS binding to the membranes in a concentration dependant manner. The stimulated 
binding was completely inhibited by 30 pM mianserin, a compound considered as a classical 
5 5-HT2C antagonist, but also known as a 5-HT2C inverse agonist. 

Although this assay measures agonist-induced binding of [ 35 S]GTPyS to membranes 
and can be routinely used to measure constitutive activity of receptors, the present cost of 
wheatgerm agglutinin beads may be prohibitive. A less costly but equally applicable 
alternative also meets the needs of large-scale screening. Flash plates and Wallac™ scintistrips 
] o may be used to format a high throughput [ 3s S]GTPyS binding assay. This technique allows 
one to monitor the tritiated ligand binding to the receptor while simultaneously monitoring the 
efficacy via [ 35 S]GTPyS binding. This is possible because the Wallac™ beta counter can 
switch energy windows to analyze both tritium and 35 S-labeled probes. 

Also, this assay may be used for detecting of other types of membrane activation 
1 5 events that result in receptor activation. For example, the assay may be used to monitor 32 P 
phosphorylation of a variety of receptors (including G protein-coupled and tyrosine kinase 
receptors). When the membranes are centrifuged to the bottom of the well, the bound 
[ 35 S]GTPyS or the 32 P-phosphorylated receptor will activate the scintillant coated on the wells. 
Use of Scinti® strips (Wallac™) demonstrate this principle. Additionally, this assay may be 
20 used for measuring ligand binding to receptors using radiolabeled ligands. In a similar manner, 
the radiolabeled bound ligand is centrifuged to the bottom of the well and activates the 
scintillant. The [ 35 S]GTPyS assay results parallel the results obtained in traditional second 
messenger assays of receptors. 

As shown in Figure 10, serotonin stimulates the binding of [ 35 S]GTPyS to the 
25 endogenous human 5-HT2C receptor, while mianserin inhibits this response. Furthermore, 
mianserin acts as a partial inverse agonist by inhibiting the basal constitutive binding of 
[ 35 S]GTPyS to membranes expressing the endogenous human 5-HT2C receptor. As expected, 
there is no agonist response in the absence of GDP since there is no GDP present to exchange 
for [ 35 S]GTPyS . Not only does this assay system demonstrate 
30 the response of the native 5-HT2C receptor, but it also measures the constitutive activation of 
other receptors. 
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Figure 11A and Figure 1 IB demonstrate the enhanced binding of [ 35 S]GTPyS to 
membranes prepared from 293T cells expressing the control vector alone, the native human 5- 
HT2C receptor or the AP-1 receptor. The total protein concentration used in the assay affects 
the total amount of [ 35 S]GTPyS binding for each receptor. The c.p.m. differential between the 
5 CMV transfected and the constitutively active mutant receptor increased from approximately 
1000 c.p.m at 10 pg/well to approximately 6-8000 c.p.m. at 75 pgAvell protein concentration, 
as shown in Figure 1 1 . 

The AP-1 receptor showed the highest level of constitutive activation followed by the 
wild type receptor, which also showed enhanced [ 35 S]GTPyS binding above basal. This is 
!0 consistent with the ability of the endogenous human 5-HT2C receptor to accumulate 
intracellular IP3 in the absence of 5HT stimulation (Example 5) and is also consistent with 
published data claiming that the endogenous human 5-HT2C receptor has a high natural basal 
activity. Therefore, the AP-1 receptor demonstrates that constitutive activity may be measured 
by proximal [ 35 S]GTPyS binding events at the membrane interface. 

15 Example 4 

SEROTONIN RECEPTOR AGONIST/ANTAGONIST COMPETITIVE BINDING ASSAY 

Membranes were prepared from transfected COS-7 cells ( see Example 2) by 
homogenization in 20 mM HEPES and 10 mM EDTA , pH 7.4 and centrifuged at 49,000 x 
g for 15 min. The pellet was resuspended in 20 mM HEPES and 0.1 mM EDTA, pH 7.4, 
20 homogenized for 10 sec. using polytron homogenizer (Brinkman) at 5000 rpm and 
centrifuged at 49,000 x g for 1 5 min. The final pellet was resuspended in 20 mM HEPES 
and 10 mM MgCh, pH 7.4, homogenized for 10 sec. using polytron homogenizer 
(Brinkman) at 5000 rpm. 

Assays were performed in triplicate 200pl volumes in 96 well plates. Assay buffer 
25 (20 mM HEPES and 10 mM MgCb, pH 7.4) was used to dilute membranes, 3 H-LSD, 3 H- 

mesulergine, serotonin (used to define non-specific for LSD binding) and mianserin (used to 
define non-specific for mesulergine binding). Final assay concentrations consisted of InM 
3 H-LSD or InM 3 H-mesulergine, 50pg membrane protein and 100pm serotonin or 
mianserin. LSD assays were incubated for 1 hr at 37° C, while mesulergine assays were 
30 incubated for 1 hr at room temperature. Assays were terminated by rapid filtration onto 
Wallac Filtermat Type B with ice cold binding buffer using Skatron cell harvester. The 
radioactivity was determined in a Wallac 1205 BetaPlate counter. 
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Example 5 

Intracellular IP3 accumulation Assay 

5 For the IP3 accumulation assay, a transfection protocol different from the protocol 

set forth in Example 2 was utilized. In the following example, the protocols used for days 
1-3 were slightly different for the data generated for Figures 12 and 14 and for Figures 13 
and 1 5; the protocol for day 4 was the same for all conditions. 

A. COS-7 and 293 Cells 

10 On day one, COS-7 cells or 293 cells were plated onto 24 well plates, usually lxlO 5 

cells/well or 2xl0 5 cells/well, respectively. On day two, the cells were transfected by first 
mixing 0.25 pg DNA (see Example 2) in 50 pi serum-free DMEM/well and then 2 pi 
lipofectamine in 50 pi serum-free DMEM/well. The solutions (“transfection media”) were 
gently mixed and incubated for 15-30 minutes at room temperature. The cells were washed 
15 with 0.5 ml PBS and then 400 pi of serum free media was mixed with the transfection 
media and added to the cells. The cells were then incubated for 3-4 hours at 37°C/5%C02. 
Then the transfection media was removed and replaced with 1 ml/well of regular growth 
media. On day 3, the media was removed and the cells were washed with 0.5 ml PBS. Then 
0.5 ml inositol-free/serum-free media ( GIBCO BRL) was added to each well with 0.25 pCi 
20 of 3 H-myo-inositol/well and the cells were incubated for 16-18 hours overnight at 
37°C/5%C0 2 . Protocol A. 

B. 293 Cells 

On day one, lxlO 7 293 cells per 150mm plate were plated out. On day two, two 
reaction tubes were prepared (the proportions to follow for each tube are per plate): tube A was 
25 prepared by mixing 20pg DNA (e.g., pCMV vector; pCMV vector AP-1 cDNA, etc.) in 1 .2ml 
serum free DMEM (Irvine Scientific, Irvine, CA); tube B was prepared by mixing 120pl 
lipofectamine (Gibco BRL) in 1 ,2ml serum free DMEM. Tubes A and B were then admixed 
by inversions (several times), followed by incubation at room temperature for 30-45min. The 
admixture is referred to as the “transfection mixture”. Plated 293 cells were washed with 
30 1XPBS, followed by addition of 10ml serum free DMEM. 2.4ml of the transfection mixture 
was then added to the cells, followed by incubation for 4hrs at 37°C/5% CO 2 . On day 3, cells 
were trypsinized and counted, followed by plating of lxl 0 6 cells/well (poly D-lysine treated 
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Mesulergine 

5HT2/1C antagonist 

21.2 

Methysergide 

5HT2/1C antagonist 

6.1 

Methiothepin 

5HT1 antagonist 

20.4 

N ormethylclozapin 

5HT2/1C antagonist 

21.4 

Fluoxetine 

5HT reuptake inhibitor 

114.0 

Ritanserin 

5HT2/1C antagonist 

19.4 


The IC 50 results confirm that the seven tested compounds showed antagonist activity at the AP- 
1 receptor. 

Example 7 

5 SCREENING OF CANDIDATE COMPOUNDS AGAINST NON-ENDOGENOUS, 
CONSTITUTIVELY ACTIVATED HUMAN SEROTONIN RECEPTORS:AP-l 

Approximately 5,500 candidate compounds (Tripos, Inc., St. Louis, MO) were 
screened using the assay protocol of Example 3 (with AP-1 mutant receptor) for identification 
] 0 as inverse agonists against the receptor; for this assay, an arbitrary cut-off of at least 50% 
inhibition was established for identification of inverse agonists. Approximately 120 of these 
compounds evidenced at least 50% inhibition of [ 35 S]GTPyS binding at 10 pM candidate 
compound (data not shown). 

Example 8 

1 5 SCREENING OF SELECTED COMPOUNDS TO CONFIRM RECEPTOR 

BINDING: AP-1 

The candidate compounds identified from Example 7 were then screened using the 
assay protocol of Example 4 (mesulergine), using the AP-1 mutant receptor. ICso (nM) 
20 values were determined; five of the nearly 120 compounds of Example 7 were determined 
to have potent binding affinity for the receptor. Results are summarized in Table 4. 
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Table 4 


Candidate Compound 

IC 50 (nM) in Mesulergine 
Assay 

102461 

205.0 

102788 

46.5 

100341 

209.0 

100431 

147.0 

103487 

{ 

1,810.0 


Example 9a 

GENERAL SCREENING PARADIGM: 

5 SELECTION OF PRE-CLINICAL CANDIDATE LEADS 

The “primary” screen designed to directly identify human 5HT2 a/5HT2c receptor 
inverse agonists consisted of a membrane-based GTPyS binding assay utilizing membranes 
prepared from COS7 cells transiently transfected with AP-1 human receptor. Candidate 
10 compounds (lOpM final assay concentration) directly identified as inhibiting receptor- 
mediated increases in GTPyS binding by greater than 50-75% (arbitrary cut-off value) were 
considered active “hits”. Primary assay hits were then re-tested in the same assay to 
reconfirm their inverse agonist activity. If primary assay hits were reconfirmed active (50% 
or greater inhibition), and therefore directly identified as, e.g., an inverse agonist, one of 
15 two approaches were available: (a) so-called “directed libraries” could be created, i.e., 
additional candidate compounds were synthesized based upon the structures of the 
reconfirmed hits (geared towards, e.g., improvement in the characteristics of the 
compounds) whereby the directed library compounds were then evaluated for the ability to 
compete for radioligand binding to both mutant 5HT2C (AP-1) and endogenous 5HT2A 
20 receptors, or (b) the reconfirmed hits were then evaluated for the ability to compete for 
radioligand binding to both mutant 5HT2C (AP-1) and endogenous 5HT2A receptors. 
Thus, when approach (a) was used, because these directed library candidate compounds 
were based upon the structures of compounds that were directly identified from the 



WO 99/52927 


PCT/US99/08168 


27 

membrane-based GTPyS binding assay, the directed library compounds were not re-tested 
in the membrane-based GTPyS binding assay but rather were then confirmed via the 
radioligand binding analysis. The radioligand binding analysis tests were initially performed 
at 10|iM test compound in triplicate and if the compound inhibited radiolabeled binding by 
5 50% or more, the analysis was followed by eight concentration competition curves to 

determine Ki values. The last step in secondary assay evaluation was to determine if test 
compounds were capable of inhibiting AP-3 receptor-mediated accumulation of inositol 
phosphates (e.g., IP3). This final assay confirms that the directly identified compounds 
retained inverse agonist properties. 

10 Example 9b 

CONSTITUTIVELY ACTIVATED HUMAN 5HT2C RECEPTOR (AP-1) 
MEDIATED FACILITATION OF GTPyS BINDING TO COS7 MEMBRANES 

1 5 This protocol is substantially the same as set forth above in Example 6. 

Primary screening assays measuring GTPyS binding to membranes prepared from 
COS7 cells transiently transfected with human mutated 5HT2C receptor (AP-1) were used 
to directly identify inverse agonists in screening libraries (Tripos, Inc.). Candidate 
compound screens were performed in a total assay volume of 200pl using scintillant-coated 
20 Wallac Scintistrip™ plates. The primary assay was comprised of the following chemicals 
(at indicated final assay concentrations): 20 mM HEPES, pH 7.4, 100 mM NaCl, 20 mM 
MgCl2, 0.2% saponin, 0.2 mM ascorbic acid, lpM GDP, 0.3 nM GTPy 35 S, and 12.5 pg of 
the above defined membranes. Incubations were performed for 60 minutes at ambient room 
temperature. The binding assay incubation was terminated by centrifugation of assay plates 
25 at 4,000 rpm for 15 minutes, followed by rapid aspiration of the reaction mixture and 
counting in a Wallac MicroBeta™ scintillation counter. 

Primary screening of candidate compounds initially involved testing of 72 test 
compounds per assay plate (96-well plates were utilized), at a final assay concentration of 
lOpM candidate compound, in single replicates. A total of sixteen wells of each plate were 
30 dedicated for an eight concentration clozapine (a confirmed 5HT2C/2A inverse agonist) 
dose response curve (duplicate determinations at each concentration). Finally, a total of five 
assay wells of each plate were dedicated to define the negative control (AP-1 receptor 
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expressing membranes without addition of candidate compounds) and three wells from each 
plate to define the positive control (membranes without AP-1 receptor). 

Reconfirmation experiments involve re-testing candidate compounds in the same 
assay described above, except that candidate compounds were evaluated in triplicate, thus 
allowing evaluation of 24 compounds per 96-well assay plate. Similar to the primary assay 
plates, an eight concentration clozapine dose response curve (duplicate determinations at 
each concentration) and the same negative and positive control wells were also included 
within each 96-well plate. 


Example 9c(l) 

COMPETITION STUDIES 
MUTATED HUMAN 5HT2C RECEPTOR (AP-1) 

Radioligand binding competition experiments were performed in a total assay volume 
of 200pl using standard 96-well microtiter plates. The final assay ingredients consisted of 
assay buffer (20mM HEPES and lOmM MgCl 2 ), InM [ 3 H]mesulergine, and 50pg of 
membranes (COS7 with AP-1 as defined above). Nonspecific [ 3 H]mesulergine binding was 
defined in the presence of lOO^M mianserin. Incubations were performed for 1 hour at 37°C. 
Receptor bound radioligand was resolved from free radioligand by rapid filtration of the assay 
mixture over a Wallac Filtermat™ Type B filter, followed by washing with ice-cold assay 
buffer using a Skatron™ cell harvester. Radioactivity was counted using a Wallac 1205 
BetaPlate™ counter. Each assay plate contained five negative control wells (membranes 
expressing receptor and no candidate compound addition) and three positive control wells 
(each containing lOOpM mianserin). For one concentration tests, candidate compounds were 
diluted into assay buffer and screened at a final concentration of 10pM, in triplicate. For IC 50 
determinations, candidate compounds were diluted in assay buffer and eight different 
concentrations were evaluated, in triplicate. A total of 16 wells were designated for an eight 
concentration mianserin dose response curve evaluation for both assays. 

Example 9c(2) 


COMPETITION STUDIES 
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WILD TYPE HUMAN 5HT2A RECEPTOR 

Radioligand binding competition experiments were performed in a total assay volume 
of 200|il using standard 96-well microtiter plates. The final assay ingredients comprised assay 
5 buffer (20mM HEPES and lOmM MgCl 2 ), InM [ 3 H]LSD, and 50pg of the above-defined 
membranes (COS7 with AP-1). Nonspecific [ 3 H]LSD binding was defined in the presence of 
lOOpM serotonin. Incubations were performed for 1 hour at 37°C. Receptor bound 
radioligand was resolved from free radioligand by rapid filtration of the assay mixture over a 
Wallac Filtermat™ Type B filter, followed by washing with ice-cold assay buffer using a 
10 Skatron™ cell harvester. Radioactivity was counted using a Wallac 1205 BetaPlate™ counter. 
Each assay plate contained five negative control wells (membranes expressing receptor and no 
candidate compound addition) and three positive control wells (containing lOOpM mianserin). 
For one concentration tests, candidate compounds were diluted into assay buffer and screened 
at a final concentration of lOpM in triplicate. For IC50 determinations, candidate compounds 
15 were diluted in assay buffer and eight different concentrations were evaluated in triplicate. A 
total of 16 wells were designated for tin eight concentration serotonin dose response curve 
evaluation for both assays. 

Example 9d 

20 RECEPTOR-MEDIATED INOSITOL PHOSPHATE ACCUMULATION 

Candidate compound identified in the assays of Examples 9a-9c were then evaluated 
for inositol phosphate accumulation, following the protocol of Example 5 (COS7 cells 
expressing human mutated 5HT2A receptor, AP-3), modified as follows: tube A was 
25 prepared by mixing 1 6 pg DNA (e.g., pCMV vector; pCMV vector AP-1 cDNA, etc.) in 1 .0ml 
serum free DMEM (Irvine Scientific, Irvine, CA); tube B was prepared by mixing 60pl 
lipofectamine (Gibco BRL) in 1.0 ml serum free DMEM. Tubes A and B were then admixed 
by inversions (several times), followed by incubation at room temperature for 30 min. The 
admixture is referred to as the “transfection mixture”. Plated 293 cells were washed with 10 
30 ml Serum Free DMEM, followed by addition of 1 1 ml Serum Free DMEM. 2.0 ml of the 
transfection mixture was then added to the cells, followed by incubation for 5hrs at 37°C/5% 
C0 2 . On day 3, cells were trypsinized and counted, followed by plating of lxlO 6 cells/well 
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(12-well plates). Cells were permitted to adhere to the wells for 8 hrs., followed by one wash 
with lx PBS. Thereafter, 0.5 pCi 3 H-inositol in 1 ml inositol-free DMEM was added per well. 

On day 4, the cells were washed with 1.5 ml PBS and then 0.9 ml of assay medium 
was added containing inositol-ffee/serum free media, 10 pM pargyline, 10 mM lithium 
chloride, for 5 min in 37°C/5% C0 2 followed by 100 pi addition of candidate compound 
diluted in the same material. The cells were then incubated for 120 minutes at 37°C. Then 
the cells were washed with 1.5 ml PBS and 200 pi of fresh/icecold stop solution (1M KOH; 
18 mM Na-borate; 3.8 mM EDTA) was added/well. The solution was kept on ice for 5-10 
minutes or until the cells were lysed and then neutralized by 200 pi of fresh/ice cold 
neutralization sol. (7.5 % HCL). The lysate was then transferred into 1.5 ml micro- 
centrifuge tubes and 1 ml of chloroform/methanol (1:2) was added/tube. The solution was 
vortexed for 15 seconds and the upper phase was applied to a Biorad AG1-X8 anion 
exchange resin ( 100-200 mesh). The resin was washed with water and 0.9 ml of the upper 
phase was loaded onto the column. The column was washed with 10 mis of 5 mM myo- 
inositol and 10 ml of 5 mM Na-borate/60mM Na-formate. The inositol trisphosphates were 
eluted into scintillation vials containing 10 ml of scintillation cocktail with 2 ml of 0.1 M 
formic acid/ 1 M ammonium formate. The columns were regenerated by washing with 10 
ml of 0.1 M formic acid/3M ammonium formate and rinsed twice with dd H 2 0 and stored at 
room temperature in water. 

Following this round of assaying, candidate compounds having an IC 5 o value of less 
than lOpM were considered as potential leads for the development of pharmaceutical 
compositions. 

SCREENING CANDIDATE COMPOUNDS 
Following the protocols set forth above, one compound, 103487 (Example 8, supra ) 
evidenced the following results: 


Figure 

GTPyS 

GTPyS 

Competitive 

Competitive 

Inositol 

Number 

AP-1 

AP-1 

Binding 

Binding 

Phosphate 


Percent 

Percent 

AP-1 

WT5HT2A 

Accumulation 


Inhibition 

Inhibition 



AP-3 


Relative 

Relative To 

([ 3 H]mesulergine) 

([ 3 H]LSD) 
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To 

Positive 

Control 

(Primary) 

Positive 

Control 

(Reconfirm) 

IC 50 Value 
(nM) 

IC 50 Value 
(nM) 

IC 50 Value 
(nM) 

15A 

(103487) 

-1% 

31% 

2100 

850 

46 

52 

90 


Based upon these results, structure activity analysis of the 103487 compound 
suggested that a series of derivatives of 3-(4-bromo-l-methylpyrazole-3-yl)phenylamine 
would exhibit similar 5-HT 2A activity and selectivity. A series of derivatives of 3-(4- 
bromo- 1 -methylpyrazole-3-y l)phenylamine have now been synthesized. These “directed” 
library compounds (Tripos, Inc.) were then analyzed in accordance with the protocols of 
Examples 9c(l), 9c(2) and 9d. 

This series of compounds exhibits highly selective 5-HT 2A activity. Accordingly, in 
the first aspect of the invention, a series of compounds possessing 5-HT 2A receptor activity 
that are useful as inverse agonists at such receptors is designated by the general formula 
(A): 



Wherein: 

W is lower alkyl (Ci- 6 ), or halogen; 
V is lower alkyl (Ci^), dr halogen; 
X is either Oxygen or Sulfur; 
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Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (Cm); 
m = 0 - 4 
n = 0 - 4 

5 R 1 is H or lower alkyl (Cm); 

R 2 is H or lower alkyl(CM); 

R 3 and R 4 are independently a Ci . 6 alkyl, or C 2 -6 alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF3, CCI3, Me, N 0 2 , OH, 

10 OMe, OEt, CONR 5 R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , SO 3 R 7 , COMe, 

COEt, CO-lower alkyl, SCF 3 CN, C 2 . 6 alkenyl, H, halogens, Cm alkoxy, C 3-6 
cycloalkyl, Cm alkyl, aryl, and aryloxy wherein each of the C 3 . 6 cycloalkyl, Cm 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 

15 OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , SO 3 R 7 , 

S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl, H, halogens, Q. 
4 alkoxy, C 3.6 cycloalkyl. Cm alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
20 substituted by up to four substituents in any position independently selected from 

CF 3 , CCI 3 , Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 3 , SMe, 
COOR 9 , SO 3 R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl, 
H, halogens. Cm alkoxy, C 3 ^ cycloalkyl, Ci . 6 alkyl, and aryl wherein each of the 
C3-6 cycloalkyl. Cm alkyl, or.aryl groups may be further optionally substituted by 
25 up to four substituents in any position independently selected from CF 3 , CCI 3 , Me, 

N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
S0 2 NR g R 9 , SO 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl, H, 
halogens. Cm alkoxy, C 3 ^ cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5 , 6 or 7 membered cyclic structure which 
30 may be either saturated or unsaturated and that may contain up to four heteroatoms 

selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF3, CCI3, Me, 
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N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , S0 3 R 7 , NHCOCH 3 , COEt. 
COMe, or halogen; 

R 7 may be independently selected from H or C|_6 alkyl; 

R 8 and R 9 are independently a H, or Ci-6 alkyl, or C 2 ^ alkenyl, or 
5 cycloalkyl, or aryl, or CH 2 aryi group and each said group may be optionally 

substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2) OH, OMe, SMe, COMe, CN, COOR 7 , 
S0 3 R 7 , COEt, NHCOCHj, or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
10 to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

Ci-6 alkyl moieties can be straight chain or branched; 

optionally substituted Ci-6 alkyl moieties can be straight chain or branched; 

C 2 _6 alkenyl moieties can be straight chain or branched; and 
15 optionally substituted C 2 .6 alkenyl moieties can be straight chain or 

branched. 

Examples of suitable C|^ alkyl groups include but art not limited to methyl, 
ethyl, n-propyl, i-propyl, n-butyl, and t-butyl. 

Halogens are typically F, Cl, Br, and I. 

20 Examples of 5 or 6 membered ring moieties include, but are not restricted 

to, phenyl, furanyl, thienyl, imidazolyl, pyridyl, pyrrolyl, oxazolyl, isoxazolyl, triazolyl, 
pyrazolyl, tetrazolyl, thiazolyl and isothiazolyl. Examples of polycycle moieties include, 
but are not restricted to, naphthyl, benzothiazolyl, benzofuranyl, benzimidazolyl, quinolyl, 
isoquinolyli indolyl, quinoxalinyl, quinazolinyl and benzothienyl. 

25 A more preferred series of compounds possessing 5 -HT 2 a receptor activity that are 

useful as inverse agonists at such receptors is designated by the general formula (B): 
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Wherein: 

W is Me, or Et, or halogen; 

X is either Oxygen or Sulfur; 

5 Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 )„R 4 ; 

Z is lower alkyl (Cm); 
m = 0 - 4 
n = 0 - 4 

R 1 is H or lower alkyl (Cm); 

10 R 2 is H or lower alkyl(CM); 

R 3 and R 4 are independently a Cm alkyl, or Cm alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR s R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe, 

15 COEt, CO-lower alkyl, SCF 3 CN, Cm alkenyl, H, halogens, Cm alkoxy, Cm 

cycloalkyl, Cm alkyl, aryl, and aryloxy wherein each of the Cm cycloalkyl, Cm 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR s R 6 , NHCOCHj, OCF3, SMe, COOR 7 , S0 3 R 7 , 

20 S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl, H, halogens, C|. 

4 alkoxy, Cm cycloalkyl, Cm alkyl, and aryl; 
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R 5 and R 6 are independently a H, or Cm alkyl, or C2-6 alkenyl, or 
cycloalkyl, or aryl, or CH2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CCIj, Me, N 0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH3, OCF 3 , SMe, 

5 COOR 9 , SO3R 7 , S 0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl. SCF 3 , CN, C 2 . 6 alkenyl, 

H, halogens. Cm alkoxy, C3-6 cycloalkyl, Ci_ 6 alkyl, and aryl wherein each of the 
C3.6 cycloalkyl, C1-6 alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF3, CCI3, Me, 
N 0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF3, SMe, COOR 7 , 

10 S 0 2 NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF3, CN, C 2 . 6 alkenyl, H, 

halogens. Cm alkoxy, C 3 ^ cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5 , 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
15 to four substituents in any position independently selected from CF3, CCI3, Me, 

N 0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S 0 2 NR 8 R 9 , SO3R 7 , NHCOCH3, COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

R 8 and R 9 are independently a H, or Cm alkyl, or C 2 -6 alkenyl, or 
20 cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 

substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF 3 , OEt, CC 1 3 , Me, N 0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO3R 7 , COEt, NHCOCH3, or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
25 to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

Cm alkyl moieties can be straight chain or branched; 

optionally substituted Cm alkyl moieties can be straight chain or branched; 

C 2 -6 alkenyl moieties can be straight chain or branched; and 
30 optionally substituted C2-6 alkenyl moieties can be straight chain or 

branched. 

Examples of suitable Cm alkyl groups include but art not limited to methyl, 
ethyl, n-propyl, i-propyl, n-butyl, and t-butyl. 
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Halogens are typically F, Cl, Br, and I. 

Examples of 5 or 6 membered ring moieties include, but are not restricted 
to, phenyl, furanyl, thienyl, imidazolyl, pyridyl, pyrrolyl, oxazolyl, isoxazolyl triazolyl, 
pyrazolyl, tetrazolyl, thiazolyl and isothiazolyl. Examples of polycycle moieties include, 
5 but are not restricted to, naphthyl, benzothiazolyl, benzofuranyl, benzimidazolyl, quinolyl, 
isoquinolyl, indolyl, quinoxalinyl, quinazolinyl and benzothienyl. 


A first series of compounds having 5-HT2A receptor activity is represented by 
class (I) of compounds of formula (B) wherein Y=NR 2 R 3 : 


R 


R 



Wherein: 

Preferably R 1 and R 2 are H. 

Preferably W is Br. 

Preferably X is O. 

15 Preferably Z is Me. 

Preferably R3 is 4-trifluoromethoxyphenyl or 4-trifluoromethoxybenzyl. 
Preferred compounds are: 
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103487 

N.[3^4-bromo-l-methylpyrazol-3-yl)phenyl][{(4-trifluoromethoxy)phenyl} amino] carboxamide 



CF 


3 


5 


116115 

N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl][ {(4-trifluoromethoxy)phenyl)methy 1 } amino]carboxamide 



10 These two compounds demonstrated the following activities using the assay 

protocols defined in the Examples above: 
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Competitive 

Competitive 

Inositol Phosphate 


Binding 

Binding 

Accumulation 

Compound Number 

AP-1 

([ 3 H]mesulergine) 

WT 5HT 2A 
([ 3 H]LSD) 

AP-3 


IC 50 Value 

IC 50 Value 

IC 50 Value 


(mM) 

(UM) 

(UM) 

103487 

2.1 

.046 

.052 

116115 

1.2 

.45 

.0171 


Additional compounds of formula (B) wherein Y=NR 2 R 3 are set forth below. 
Inositol phosphate accumulation assays evidence the activity of test compounds. Both single 
concentration percentages of control values and IC50 determinations indicate activity. In the 
5 tables below the column legends have the following meanings: 

IPt % Contol : The values in this column reflect an IP Accumulation Assay where the 
test compounds were evaluated at one concentration of 10 pM. For these assays, the compound 
was diluted into inositol-free Dulbecco's Eagle Media containing 10 pM pargyline and 10 mM 
LiCl and tested at a final assay concentration of 10 pM, in triplicate. The percent control value 
10 was calculated based on the control in which no test compound was added. 

IP-i AP-3 ICsn nM : The values in this column reflect an IP accumulation assay in which 
the test compound was evaluated at several different concentrations whereby an IC50 could be 
determined. This column corresponds to the column appearing in the tables above which is 
labeled: Inositol Phosphate Accumulation, AP-3, IC50 Value (pM). 

1 5 WT 5HTta LSD ICUn nM : The values in this column reflect a competitive binding 

assay using LSD. This column corresponds to the column appearing in the tables above 
which is labeled: Competitive Binding, WT 5HT2A, ([ 3 H]LSD), IC50 Value (pM). 

Compounds listed in each of the following tables reference the structures 
immediately preceding the table. A “dash” in the table indicates that no value was 
20 determined. 
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R 3 




Compound 

No. 

R 1 

R 2 

R 3 

R 4 

X 

u 


IP 3 
AP -3 
IC50 nM 

WT 
5 HT 2a 
LSD 
IC50 nM 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][(4-methylthiophenyl)amino]carboxamide 

116079 

sch 3 

H 

H 

H 

0 

NH 

16 

17 

4 

N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl][ (4-chlorophenyl)amino]carboxamide 

116081 

Cl 

H 

H 

H 

0 

NH 

10 

3.2 

11 

{[3-(4-broino-l-methylpyrazol-3-yl)phenyI]amino}-N-(4-fluorophenyl)carboxamide 

116082 

F 

H 

H 

H 

0 

NH 

11 

- 

7 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[2-(trifluoromethoxy)phenyl]carboxarnide 

116087 

H 



H 

0 

NH 

11 

- 

200 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]am!no}-N-(2-nitrophenyl)carboxamide 

116089 

H 

H 

no 2 

H 

0 

NH 

27 

- 

238 
























































{[3-(4-bromo-l-methylpyrazol-3-yl)phenyrjamino}-N-(4-methoxyphenyI)carboxamide 


116091 MeO H H H O NH 12 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-methylphenyl)carboxamide 


116092 H H Me H O NH 32 - 131 


{[3-(4-bromo-l-methylpyrazol-3-yI)phenyl]amino}-N-[4-(trifluorornethyl)phenyl]carboxamide 


116097 CF 3 H H H O NH 11 - 65 


{[3.(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-chlorophenyl)carboxamide 


116105 H Cl H H O NH 11 - 39 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-chlorophenyl)carboxamide 


116108 H H Cl H O NH 


{[3-(4-bromo-l-methylpyrazoI-3-yl)phenyl]amino}-N-[4-(methylethyl)phenyl]carboxamide 


116110 isopropyl H H H O NH 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]ainino}-N-(3-methoxyphenyl)carboxamide 


116111 H IMeO H H O NH 7 - 106 


{[3-{4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-methylphenyl)carboxainide 


116112 H Me H HO NH 14 


[{3-(4.bromo-l-methylpyrazol-3-yl)phenyl}amino]-N-methyl-N-[4-(trifluoromethoxy)phenyl]carboxainide 


116113 CF 3 0 H H H O NCH 3 - 193 2 


N-[4-(tert-butyl)phenyl]{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino} carboxamide 


116119 t-butyl I H H H O NH 17 - 476 


N-[4-(dimethylamino)phenyl]{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}carboxamide 


309 


NMe2 

H 

H 

H 

O 

NH 

9 


116122 
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-(3,5-dichloro-4-methylphenyl){[3-{4-bromo-l-methylpyrazol-3-yl)phenyI]amino} carboxamide 


116138 Me Cl H Cl 0 NH 23 - 122 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[4- 

(trifluoromethylthio)phenyl]carboxamide 


116139 CF 3 S H H H O NH 12 - 56 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-fluorophenyl)carboxamide 


116144 1 H | H F H O NH 12 - 37 

2-({[3-(4-bromo-l-methylpyrazol-3-yI)phenyl]amino}carbonyiamino)benzamide 


116145 I H H |CONH 2 H O NH 31 - 7473 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(4-cyanophenyl)carboxamide 


116147 CN H H H 0 NH 12 - 2 


{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-cyanophenyl)carboxamide 


116148 H H CN H O NH 30 












Compound 

No. 


116141 


N-[3-(4-bromo-l-methylpyrazol-3- 

yl)phenyl][cyclohexylamino]carboxamide 


IP3 
AP-3 
IC50 nM 

WT 
5HT 2a 
LSD 
ICso nM 

114 

81 


Compound 

No. 


R 1 

R 2 

R 3 

R 4 

R 5 

IP 3 

AP-3 






ICso nM 


IC 50 nM 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][phenylmethylamino]carboxamide 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(4-fluorophenyl)methyl}amino]carboxamide 


116182 


H 

H 

89 

132 
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116183 OMe OMe H 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(3,4,5- 

trimethoxyphenyl)methyl}amino]carboxamide 


116184 I OMe I OMe 


N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][{(2-methylphenyl)methyl}amino]carboxamide 


116185 H 


N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl][ {(4-methoxyphenyl)methyl}amino]carboxamide 


102 


116189 

OMe | 

H 

H 

H ! 

1 

H 


Compound 

No. 


R 1 

R 2 

R 3 

R 4 

R 5 

IP 3 

AP-3 






ICso nM 


WT 


ICso nM 


N-[3-(4-bromo- 1 -methyipyrazol-3-yl)phenyl][ {2-(4-methoxyphenyl)ethyl } amino]carboxamide 





































WO 99/52927 


PCT/US99/08168 


44 


116194 

OMe 

H 

H 

H 

H 

32 

61 


5 A second series of compounds having 5 -HTza receptor activity is represented by a 

class (II) of compounds of formula (B) wherein Y= 0(CH2) n R4: 



(D) 

Wherein: 

10 Preferably R 1 is H. 

Preferably W is Br. 

Preferably X is O. 

Preferably Z is Me. 

Preferably when n = 0, R 4 is 4-methoxyphenyl or tertiary butyl. 
1 5 Preferred compounds are: 


116100 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][4-methoxyphenoxylcarboxamide 
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116192 

(tert-butoxy)-N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]carboxamide 



These two compounds demonstrated the following activity: 



Competitive 

Competitive 

Inositol Phosphate 


Binding 

Binding 

Accumulation 


AP-1 

WT 5HT 2a 

AP-3 

Compound No. 

([ 3 H]mesulergine) 

(t 3 H]LSD) 



IC 50 Value 

IC 50 Value 

IC 50 Value 


(pM) 

(pM) 

(pM) 

116100 

1.8 

<0.001 

0.0003 
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116192 ! 

. 

- 

0.014 

0.057 


In addition to the assays discussed above, the specific activity of 1 16100 at the 
5HT2A receptor was further confirmed by the following. 

Tn Vitro Binding of 5 HT 7 a Receptor 

5 Animals: 

Animals (Sprague-Dawley rats) were sacrificed and brains were rapidly dissected 
and frozen in isopentane maintained at ~42°C. Horizontal sections were prepared on a 
cryostat and maintained at -20°C. 

LSD Displacement Protocol: 

10 Lysergic acid diethylamide (LSD) is a potent 5HT2A receptor and dopamine D2 

receptor ligand. An indication of the selectivity of compounds for either or both of these 
receptors involves displacement of radiolabeled-bound LSD from pre-treated brain sections. 
For these studies, radiolabeled I 125 -LSD (NEN Life Sciences, Boston, MA., Catalogue 
number NEX-199) was utilized; spiperone (RBI, Natick, MA. Catalogue number s-128), a 
15 5HT2A receptor and dopamine D2 receptor antagonist, was also utilized. Buffer consisted 
of 50 nanomolar TRIS-HC1, pH 7.4 

Brain sections were incubated in (a) Buffer plus 1 nanomolar I 1J5 -LSD; (b) Buffer 
plus 1 nanomolar I I25 -LSD and 1 micromolar spiperone; or Buffer plus 1 nanomolar I 125 - 
LSD and 1 micromolar 116100 for 30 minutes at room temperature. Sections were then 
20 washed 2X 10 minutes at 4°C in Buffer, followed by 20 seconds in distilled H 2 O. Slides 
were then air-dried. 

After drying, sections were apposed to x-ray film (Kodak Hyperfilm) and exposed 
for 4 days. 

Analysis: 

25 Figures 16A-C provide representative autoradiographic sections from this study. 

Figure 16A evidences darker bands (derived from I 125 -LSD binding) primarily in both the 
fourth layer of the cerebral cortex (primarily 5HT2A receptors), and the caudate nucleus 
(primarily dopamine D2 receptors and some 5HT2A receptors). As can be seen from Figure 
16B, spiperone, which is a 5HT2A and dopamine D2 antagonist, displaces the I I25 -LSD from 
30 these receptors on both the cortex and the caudate. As can be further seen from Figure 16C, 
116100 appears to selectively displace the I ,25 -LSD from the cortex (5HT 2A ) and not the 
caudate (dopamine D2). 
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A third series of compounds having 5 -HT 2 a receptor activity is represented by a 
class (III) of compounds of formula (B) wherein Y= (CH 2 ) m R 4 : 



(III) 


5 Wherein: 

Preferably W is Br. 

Preferably X is O. 

Preferably Z is Me. 

Preferably R 1 is H. 

10 Preferably when m = 0, R 4 is preferably 4-trifluoromethoxyphenyl, or 

thiophene, or 4-chlorophenyl. 

Preferred compounds are: 

116101 

15 m = 0, R 1 = H, R 4 = 4-trifluoromethoxyphenyl 

N_[3-(4-bromo-l-methylpyrazol-3-yl)pheny!][4-trifluoromethoxyphenyl]carboxamide 
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H 



CF 


3 


116102 

5 m = 0, R 1 = H, R 4 = thiophene 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][2-thienyl]carboxamide 

H 



10 116120 

m = 0, R 1 = H, R 4 = 4-chlorophenyl 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][4-chlorophenyl]carboxamide 
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H 



These three compounds demonstrated the following activities: 


Compound Number 

Competitive 

Binding 

AP-1 

([ 3 H]mesulergine) 
IC 50 Value 
(uM) 

Competitive 

Binding 

WT5HT 2A 

([ 3 H]LSD) 
IC 50 Value 

(HM) 

Inositol Phosphate 
Accumulation 
AP-3 

IC 50 Value 
(uM) 

116101 

6.1 

.46 

0.0213 

116102 

2.8 

.17 

0.080 

116120 

1.2 

21 

0.0315 


5 In Vivo Analysis of Compound 116102 

In addition to the in vitro assays shown in the above table, the in vivo response of 
animals to the 116102 compound is demonstrated by the following. 

A 5 HT 2 A receptor antagonist or inverse agonist is expected to decrease 
amphetamine-stimulated locomotion without affecting baseline locomotion. See, for 
10 example , Soresnon, et al, 266(2) J. Pharmacol. Exp. Ther. 684 (1993). Based upon the 

foregoing information, Compound 1 16102 is a potent inverse agonist at the human 5HT2A 
receptor. For the following study, the following parameters and protocol were utilized: 
Animals. Vehicle 

Adult male Sprague-Dawley rats were utilized for these studies. Animals were 
15 housed in groups of 2-3 in hanging plastic cages with food and water available at all times. 
Animals were weighed and handled for at least one day prior to surgery and throughout the 
studies. For these studies. Vehicle consisted of 90% ethanol (100%) and 10% water. 
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Amphetamine-stimulated locomotor activity: Assessment and Apparatus 

A San Diego Instruments Flex Field apparatus was used to quantify baseline and 
amphetamine-stimulated locomotor activity. This apparatus consists of four 16” x 16” clear 
plastic open fields. Photocell arrays (16 in each dimension) interfaced with a personal 
5 computer to automatically quantify activity. Several measures of activity can be assessed 
with the apparatus, including total photocell beam breaks. Animals (vehicle control and 
Compound treated) were injected s.c. 30 minutes prior to initiation of analysis. Following 
this 30 minute period, animals were placed individually into an open field and baseline 
activity was assessed for 30 minutes (habituation phase). Following baseline, animals were 
10 removed, injected with d-amphetamine sulfate (1.0 mg/kg) and immediately returned to the 
open field for 150 minutes, in order to follow the time course (10 minute intervals) of 
amphetamine-stimulated locomotor activity. 


Dosing 


Vehicle Control 

Compound 116102 

Dose (mg/kg) 

6 animals 

6 animals 

0.1 


6 animals 

1.0 


6 animals 

5.0 


6 animals 

10.0 


15 Analysis 

Results, based upon the number of recorded photobeam breaks (mean +/- sem), are 
presented in Figure 17A-C. As supported by Figures 17A,B and C, a general “inverted U” 
shaped pattern was observed (see, generally , Sahgal, A. “Practical behavioural 
neuroscience: problems, pitfalls and suggestions” pp 1-8, 5 in Behavioral Neuroscience: A 
20 Practical Approach. Volume 1 A. Sahgal (Ed.) 1993, IRL Press, New York). As Figure 17 
also indicates, with exception of the highest dose (lOmg/kg), in vivo, the tested doses of 
Compound 116102 evidenced a decrease in the amphetamine-stimulated locomotion, 
consistent with a 5HT2A receptor antagonist or inverse agonist. 
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Additional compounds of formula (B) wherein Y= (CHi) m R are set forth below. 


R 3 



Compound 

No. 

R 1 

R 2 

R 3 

R 4 

IPs 

IC50 bM 

LSD 
IC50 nM 

N-[3-(4-b 

iromo-l-methylpyrazol-3-yl)phenyl]-2-[4-(trifluoromethoxy)phenyl]acetamide 

116137 

ocf 3 

H 

H 

H 


1— 

N-| 

[3-<4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(3-fluorophenyl)acetamid( 
- — — — r 

3 

116174 

H 

F 

H 

H 

153 

318 

N-[: 

K4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(3-methoxyphenyl)acetamide 

116175 

H 

OMe 

H 

H 


625 

N- 

[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(2-fluorophenyl)acetamide 

116176 

H 

H 

F 

H 

129 

662 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-{4-nitrophenyl)acetamidi 

a 

116177 

no 2 

H 

H 

H 

61 

108 
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N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(2-methoxyphenyl)acetamide 


116178 

H 

H 

OMe 

H 

165 

2300 


compound names not provided 


Based upon the discovery of the specific inverse agonist activity of the above 
identified compounds at the 5HT2A receptor, a novel class of compounds has been identified 
5 which exhibits said activity. Accordingly, in the second aspect of the invention, there is 
provided a novel compound of formula (C): 



Wherein: 

W is Me, or Et, or halogen; 

10 X is either Oxygen or Sulfur; 

Y is NR 2 R\ or (CH 2 ) m R\ or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (Ci^); 
m = 0 - 4; 
n =0-4; 

15 R 1 is H or lower alkyl (Ci. 4 )\ 

R 2 is H or lower alkyl(Ci- 4 ); 

R 3 is a Ci -6 alkyl, or C 2 -6 alkenyl, or cycloalkyl, or (CH 2 )karyl group (k = 1 - 
4), preferably k = 1, and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CF 3 , CCI 3 , Me, N0 2 , OH, 
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OMe, OEt, CONR 5 R 6 , NR s R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe, 
COEt, CO-lower alkyl, SCF 3 CN, Cm alkenyl, H, halogens. Cm alkoxy, C 3 . 6 
cycloalkyl, Cm alkyl, aryl, and aryloxy wherein each of the C 3 . 6 cycloalkyl, Ci . 6 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
5 substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, 

OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , S0 3 R 7 , 
S0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 -6 alkenyl, H, halogens, Ci. 

4 alkoxy, Cm cycloalkyl, Cm alkyl, and aryl; 

R 4 is a C i -6 alkyl, or C 2 _ 6 alkenyl, or cycloalkyl, or aryl group and each said 
10 group may be optionally substituted by up to four substituents in any position 

independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, 
SCF 3 CN, C 2 -6 alkenyl, H, halogens, Cm alkoxy, C 3 -6 cycloalkyl. Cm alkyl, aryl, and 
aryloxy wherein each of the C 3 ^ cycloalkyl, Cm alkyl, aryl, or aryloxy groups may 
15 be further optionally substituted by up to four substituents in any position 

independently selected from CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 , S0 3 R 7 , S0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF 3 , CN, C 2-6 alkenyl, H, halogens. Cm alkoxy, Cm cycloalkyl, Cm 
alkyl, and aryl; 

20 R 5 and R 6 are independently a H, or Cm alkyl, or Cm alkenyl, or 

cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CC1 3 , Me, N0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH 3 , OCF 3 , SMe, 
COOR 9 , SO 3 R 7 , S0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl, 
25 H, halogens, Cm alkoxy, Cm cycloalkyl, Cm alkyl, and aryl wherein each of the 

C 3 .6 cycloalkyl, Cm alkyl, or.aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
S 02 NR 8 R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl, H, 

30 halogens. Cm alkoxy, Cm cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5 , 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
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to four substituents in any position independently selected from CF3, CCI3, Me, 

N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , SO 3 R 7 , NHCOCH 3 , COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or C |. 6 alkyl; 

5 R 8 and R 9 are independently a H, or Ci - 6 alkyl, or C 2 -6 alkenyl, or 

cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO 3 R 7 , COEt, NHCOCH 3 , or aryl; 

10 an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 

to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

Ci -6 alkyl moieties can be straight chain or branched; 

optionally substituted Ci -6 alkyl moieties can be straight chain or branched; 

1 5 C 2 . 6 alkenyl moieties can be straight chain or branched; and 

optionally substituted C 2 -6 alkenyl moieties can be straight chain or 
branched; 

with the proviso that said compound is not: 

N-[3 -(4-bromo- 1 -methylpyrazol-3-yl)phenyl] [methylamino] carboxamide, or 

20 N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl] [ {(4-trifluoromethoxy)phenyl } amino] 

carboxamide, or 

N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl] [2-chlorophenyl]carboxamide, or 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][2-chloro-3-pyridyl]carboxamide,or 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][trichloromethyl]carboxamide. 

25 Examples of suitable Ci^ alkyl groups include but art not limited to methyl, 

ethyl, n-propyl, i-propyl, n-butyl, and t-butyl. 

Halogens are typically F, Cl, Br, and I. 

Examples of 5 or 6 membered ring moieties include, but are not restricted 
to, phenyl, furanyl, thienyl, imidazolyl, pyridyl, pyrrolyl, oxazolyl, isoxazolyl, triazolyl, 

30 pyrazolyl, tetrazolyl, thiazolyl and isothiazolyl. Examples of polycycle moieties include, 
but are not restricted to, naphthyl, benzothiazolyl, benzofuranyl, benzimidazolyl, quinolyl, 
isoquinolyl, indolyl, quinoxalinyl, quinazolinyl and benzothienyl. 


WO 99/52927 


PCT /U S99/08 1 68 


55 

Synthetic Approaches 

The compounds disclosed in this invention may be readily prepared according to a 
variety of synthetic manipulations, all of which would be familiar to one skilled in the art. In 
the general syntheses set forth below, the labeled substituents have the same identifications as 
5 set out in the definitions of the compounds above. 

Compounds of general formula (I) can be obtained via a variety of synthetic routes all 
of which would be familiar to one skilled in the art. The reaction of isocyanates with amines is 
a commonly practised method for the formation of ureas (see Org. Syn. Coll. Vol. V, (1973), 
555). Amine (IV), 3-(4-bromo-l-methylpyrazole-3-yl)phenylamine, commercially available 
10 from Maybridge Chemical Company, Catalog No. KM01978, CAS No. 175201-77-1] reacts 
readily with isocyanates (V) in inert solvents such as halocarbons to yield the desired ureas of 
general formula (I) wherein R 1 = R 2 = H: 



Alternatively the amine (IV) can be converted to the corresponding isocyanate (VI) 
15 by the action of phosgene or a suitable phosgene equivalent, e.g. triphosgene, in an inert 
solvent such as a halocarbon in the presence of an organic base such as triethylamine or 
ethyldiisopropylamine. Isocyanate (VI) reacts with amines of general formula (VII), in an 
analogous fashion to that described above for the reaction of (IV) with (V), yielding the 
desired ureas of general formula (I) wherein R 1 = H: 
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Alternatively wherein the isocyanate of general formula (V) is not commercially available it 
can be prepared from the corresponding amine of general formula (VIII) in an analogous procedure 
to that described above for the preparation of (VI). Reaction of these isocyanates with (IV) would 
5 again yield the requisite ureas of general formula (I) wherein R 1 = R 2 = H: 
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H 




R 3 


R 1 = R 2 = H 


Amines of general formula (VII) are also readily converted to activated isocyanate 
equivalents of general formula (IX) by the sequential action of carbonyldiimidazole and methyl 
iodide in tetrahydrofuran and acetonitrile respectively (R.A. Batey et al. Tetrahedron Lett., (1998), 
39, 6267-6270*.) Reaction of (IX) with (IV) in an inert solvent such as a halocarbon would yield the 
requisite ureas of general formula (I) wherein R 1 = H: 
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Amine (IV) may be monomethylated according to the procedure of J. Barluenga et al, J. 
Chem. Soc., Chem. Commun., (1984), 20, 1334-1335, or alkylated according to the procedure of P. 
Marchini et al, J. Org. Chem., (1975), 401231. 3453-3456, to yield compounds of general formula 
(X) wherein R 1 — lower alkyl. These materials may be reacted as above with reagents of general 
formula (V) and (IX) as depicted below: 
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(I) R 1 = lower alkyl 


Compounds of general formula (II) can similarly be obtained via a variety of synthetic 
manipulations, all of which would be familiar to one skilled in the art. The reaction of amine (IV) 
with chloroformates (see Org. Syn. Coll. Vol. IV, (1963), 780) of general formula (XI) in an inert 
solvent such as ether or halocarbon in the presence of a tertiary base such as triethylamine or 
ethyldiisopropylamine readily yields the requisite carbamates of general formula (II) wherein R 1 = 
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H. Analogously amines of general formula (X) react similarly with chloroformates (XI) to yield the 
requisite carbamates of general formula (II) wherein R 1 = lower alkyl: 



R' 


R' 



0(CHj) n R 4 — 



An alternative route employs the ready reaction of an alcohol with an isocyanate. Thus 
isocyanate (VI) described previously reacts readily with alcohols (XII) in an aprotic solvent such as 
ether or chlorocarbon to yield the desired carbamates of general formula (II) wherein R 1 = H: 
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Chloroformates of general formula (XI) not commercially available may be readily prepared 
from the corresponding alcohol (XII) in an inert solvent such as toluene, chlorocarbon or ether by 
the action of excess phosgene (see Org. Syn. Coll. Vol. Ill, (1955), 167): 


HO(CH 2 ) n R 4 


phosgene 


(XII) 



5 

Compounds of general formula (III) can be obtained via a variety of synthetic 
manipulations, all of which would be familiar to one skilled in the art. The reaction of amine (IV) 
with acid chlorides (see Org. Syn. Coll. Vol. V, (1973), 336) of general formula (XIII) to yield the 
desired amides (III) wherein R 1 = H is readily achieved in an inert solvent such as chloroform or 
10 dichloromethane in the presence of an organic base such as triethylamine or ethyldiisopropylamine. 
In an identical fashion amines of general formula (X) would react with acid chlorides (XIII) to yield 
the desired amides (III) wherein R 1 = lower alkyl: 
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R' 



(X) R 1 = lower alkyl 


R 1 



CHj r' = lower alkyl 

(HD 


Alternatively the corresponding acids of general formula (XIV) may be coupled with 
5 dicyclohexylcarbodiimide (DCC)/hydroxybenzotriazole (HOBT) (see W. Konig et al, Chem. Ber., 
(1970), 103. 788) or hydroxybenzotriazole (HOBT)/2-(lH-benzotriazole-l-yl)-l, 1,3,3- 

tetramethyluronium hexafluorophosphate (HBTU) (see M. Bematowicz et al., Tetrahedron Lett., 
(1989), 30, 4645) as condensing agents in dimethylformamide or chloroform to amines (IV) and 
(X) respectively yielding products identical to those described in the previous scheme: 
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DCC/HOBT 

or 

HOBT/HBTU 



(III) 


R 



O 


HO^ ^(CH 2 )R 4 
(XIV) 


(X) R 1 = lower alkyl 


DCC/HOBT 

or 

HOBT/HBTU 



(III) 



WO 99/52927 


PCT/TJS99/08168 


64 

The acids of general formula (XIV) are readily converted to the corresponding acid 
chlorides (XIII) by the action of thionyl chloride or oxalyl chloride in the presence of catalytic 
dimethy lformamide : 


O 



SOCl 2 or (COCI) 2 

DMF cat. 


O 



(XIV) 


(xin) 


5 

A third aspect of the present invention provides a compound of formula (A) or a solvate or 
physiologically functional derivative thereof for use as a therapeutic agent, specifically as a 
modifier of the activity of the serotonin 5-HT2A receptor. Modifiers of the activity of the serotonin 
5-HT2A receptor are believed to be of potential use for the treatment or prophylaxis of CNS, 
10 gastrointestinal, cardiovascular, and inflammatory disorders. Compounds of the formula (A) may 
be administered by oral, sublingual, parenteral, rectal, or topical administration. In addition to the 
neutral forms of compounds of formula (A) by appropriate addition of an ionizable substituent, 
which does not alter the receptor specificity of the compound, physiologically acceptable salts of 
the compounds may also be formed and used as therapeutic agents. Different amounts of the 
1 5 compounds of formula (A) will be required to achieve the desired biological effect. The amount will 
depend on factors such as the specific compound, the use for which it is intended, the means of 
administration, and the condition of the treated individual. A typical dose may be expected to fall in 
the range of 0.001 to 200 mg per kilogram of body weight of the treated individual. Unit does may 
contain from 1 to 200 mg of the compounds of formula (A) and may be administered one or more 
20 times a day, individually or in multiples. In the case of the salt or solvate of a compound of 
formulas (A), the dose is based on the cation (for salts) or the unsolvated compound. 


25 
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A fourth aspect of the present invention provides pharmaceutical compositions 
comprising at least one compound of formula (A) and/or a pharmacologically acceptable 
5 salt or solvate thereof as an active ingredient combined with at least one pharmaceutical 
carrier or excipient. Such pharmaceutical compositions may be used in the treatment of 
clinical conditions for which a modifier of the activity of the serotonin 5-HT 2A receptor is 
indicated. At least one compound of formula (A) may be combined with the carrier in either 
solid or liquid form in a unit dose formulation. The pharmaceutical carrier must be 
10 compatible with the other ingredients in the composition and must be tolerated by the 
individual recipient. Other physiologically active ingredients may be incorporated into the 
pharmaceutical composition of the invention if desired, and if such ingredients are 
compatible with the other ingredients in the composition. Formulations may be prepared by 
any suitable method, typically by uniformly mixing the active compound(s) with liquids or 
1 5 finely divided solid carriers, or both, in the required proportions, and then, if necessary , 
forming the resulting mixture into a desired shape. 

Conventional excipients, such as binding agents, fillers, acceptable wetting agents, 
tabletting lubricants, and disintegrants may be used in tablets and capsules for oral 
administration. Liquid preparations for oral administration may be in the form of solutions, 
20 emulsions, aqueous or oily suspensions, and syrups. Alternatively, the oral preparations may 
be in the form of dry powder which can be reconstituted with water or another suitable liquid 
vehicle before use. Additional additives such as suspending or emulsifying agents, non- 
r aqueous vehicles (including edible oils), preservatives, and flavorings and colorants may be 
added to the liquid preparations. Parenteral dosage forms may be prepared by dissolving the 
25 compound of the invention in a suitable liquid vehicle and filter sterilizing the solution before 
filling and sealing an appropriate vial or ampoule. These are just a few examples of the many 
appropriate methods well known in the art for preparing dosage forms. 

The fifth aspect of the present invention provides for the use of a compound of 
formula (A) in the preparation of a medicament for the treatment of a medical condition for 
30 which a modifier of the activity of the serotonin 5-HT 2A receptor is indicated. 

A sixth aspect of the present invention provides for a method of treatment of a 
clinical condition of a mammal, such as a human, for which a modifier of the activity of the 
serotonin 5 -HT 2a receptor is indicated, which comprises the administration to the mammal 
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of a therapeutically effective amount of a compound of formula (A) or a physiologically 
acceptable salt, solvate, or physiologically functional derivative thereof. 

Experimental Data 

Mass spectra were recorded on a Micromass Platform LC with Gilson HPLC. Infra- 
red spectra were recorded on a Nicolet Avatar 360 FT-IR. Melting points were recorded on 
a Electrothermal IA9200 apparatus and are uncorrected. Proton nuclear magnetic resonance 
spectra were recorded on a Bruker 300MHz machine. Chemical shifts are given with respect 
to tetramethylsilane. In the text the following abbreviations are used; s (singlet), d (doublet), 
t (triplet), m (multiplet) or combinations thereof. Chemical shifts are quoted in parts per 
million (ppm) and with coupling constants in Hertz. 

Thin layer chromatography was carried out using aluminium backed silica plates 
(250pL; GF 254 ). HPLC was recorded either on a HP Chemstation 1100 HPLC using a 
Hichrom 3.5 C18 reverse phase column (50mm x 2.1mm i.d.). Linear gradient elution over 
5 minutes - 95% water (+0.1% TFA) / 5% acetonitrile (+0.05% TFA) down to 5% water / 
95% acetonitrile. Flow rate 0.8mL/min [Method A]; or on a Hichrom 3.5 C18 reverse phase 
column (100mm x 3.2mm i.d.). Linear gradient elution over 11 minutes - 95% water 
(+0.1% TFA) / 5% acetonitrile (+0.05% TFA) down to 5% water / 95% acetonitrile. Flow 
rate lmL/min [Method B]. Samples were routinely monitored at 254nM unless otherwise 
stated. 

All reagents were purchased from commercial sources. 

Experiment 1 

Preparation and Analysis of 1 03487 

N-[3^4-bromo-l-methylpyrazol-3-yl)phenyl][{(4-trifluoromethoxy)phenyl}amino] carboxamide 

This compound is commercially available from Maybridge Chemical Company, 
Catalog No. KM04515. 

Experiment 2 

Preparation and Analysis of 1 1 61 00 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][4-methoxyphenoxy]carboxamide 

To 4-methoxyphenylchloroformate (19mg, O.lOmmol) in CH 2 CI 2 (0.5mL) was added 
dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyTazole (25mg, O.lOmmol) and 
triethylamine (14pL, O.lOmmol) in CH 2 CI 2 (0.5mL). The mixture was stirred for 16 h and 
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concentrated. Chromatography on flash silica (40% EtOAc/hexane) gave the title compound as 
a colourless solid (21 mg, 52%), m.p. 140.3-1 4 1.8°C (EtOAc/hexane). 

IR: v max = 1748, 1592, 1504, 1412, 1190, 835, 764, 676 cm' 1 . MS (ES+): m/z (%) = 
404 (M+H 8l Br, 100), 402 (M+H 79 Br, 90). 

5 ‘H-NMR (CD 3 OD): 8 = 3.80 (3H, s, CH 3 ), 3.81 (3H, s, CH 3 ), 6.91-6.98 (2H, m, ArH), 

7.07-7.18 (3H, m, ArH), 7.42-7.53 (4H, m, ArH). HPLC: retention time 3.28 mins [Method 
A], Tic : Rf0.4 (EtOAc/hexane). 

Experiment 3 

1 o Preparation and Analysis of 1 1 6 1 0 1 

N.[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][4-trifluoromethoxyphenyl]carboxamide 

To 4-(trifluoromethoxy)benzoyl chloride (19pL, 0.12mmol) in CH 2 CI 2 (lmL) was 
added dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30mg, 
0.12mmol) and triethylamine (17pL, 0.12mmol) in CH 2 CI 2 (0.5mL). The reaction mixture 
15 was stirred for 16 h and concentrated. Chromatography on flash silica (50% EtOAc/hexane) 
gave the title compound as a colourless solid (40mg, 76%), m.p. 138.6-139.6°C 
(EtOAc/hexane). 

MS (ES+): m/z (%) = 442 (M+H 81 Br, 93), 440 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 8 = 3.79 (3H, s, CH 3 ), 7.27 (1H, m, ArH), 7.45-7.60 (3H, m, 

20 ArH), 7.65 (1H, s, ArH), 7.87 (2H, m, ArH), 8.09 (2H, m, ArH), 10.51 (1H, s, NH). 

HPLC: retention time 3.60 min [Method A]. TLC: Rf 0.40 (50% EtOAc/hexane). 

Experiment 4 

Preparation and Analysis of 1 1 6 1 02 

25 N-[3-(4-bromo- 1 -methy lpyrazol-3-y l)pheny 1] [2-th ienyl]carboxam ide 

To thiophene-2-carbonyl chloride (llpL, 0.09mmol) in CH 2 CI 2 (lmL) was added 
dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (25mg, 0.09mmol) 
and triethylamine (14|iL, 0.09mmol) in CH 2 CI 2 (0.5mL). The reaction mixture was stirred 
for 16 h and concentrated. Chromatography on flash silica (50% EtOAc/hexane) gave the 
30 title compound as a colourless solid (24mg, 68%), m.p. 127.8-128.6°C (EtOAc/hexane). 

MS (ES+): m/z (%) = 364 (M+H 81 Br, 96), 362 (M+H 79 Br, 100). 
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‘H-NMR (CD3OD): 6 = 3.81 (3H, s, CH 3 ), 7.19 (2H, m, ArH), 7.48-7.58 (2H. m, 
ArH), 7.68-7.83 (3H, m, ArH), 7.93 (1H, dd, J=1.0, 3.8, ArH). 

HPLC: retention time 3.12 min [Method A]. TLC: Rf 0.30 (30% EtOAc/hexane). 

5 Experiment 5 

Preparation and Analysis of 116115 
N_[3.(4-bromo-l -methylpyrazol-3-yl)phenyl][{(4- 
trifluoromethoxy)phenyl)methyl } amino]carboxamide 

To a stirred solution of triphosgene (12mg, 0.04mmol) in CH 2 C1 2 (0.5mL) was 
10 added dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30mg, 
0.12mmol) and triethylamine (33pL, 0.24mmol) in CH 2 C1 2 (0.5mL). After 1 h, 4- 
(trifluoromethoxy)benzylamine (23mg, 0.12mmol) was added. The reaction mixture was 
stirred for 16 h and concentrated. Chromatography on flash silica (75%EtOAc/hexane) gave 
the title compound as a colourless solid (38mg, 68%), m.p. 144.6-145. 8°C (EtOAc/hexane). 

15 IR: v ma x = 1626, 1558, 1278, 1 160, 969, 871, 789, 703 cm' 1 . MS (ES+): mJz (%) = 471 

(M+H 81 Br, 91), 469 (M+H 79 Br, 100). 

'H-NMR (CD3OD): 6 = 3.81 (3H, s, CH 3 ), 4.42 (2H, s, CH 2 ), 7.06 (1H, d, J=7.1, ArH), 
7.24 (2H, d, J=8.4, ArH), 7.37-7.52 (6H, m, ArH). HPLC: retention time 3.06 mins [Method 
A], Tic: Rf 0.5 (EtOAc). 

20 

Experiment 6 

Preparation and Analysis of 1 1 6 1 20 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][4-chlorophenyI]carboxamide 

To 4-chlorobenzoyl chloride (15mg, 0.08mmol) in CH 2 C1 2 (lmL) was added 
25 dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (21mg, 0.08mmol) 
and triethylamine (12pL, 0.08mmol) in CH 2 C1 2 (0.5mL). The mixture was stirred for 16 h 
and concentrated. Chromatography on flash silica (50% EtOAc/hexane) gave the title 
compound as a colourless solid (23mg, 72%), m.p. 184.4-1 84.8°C (EtOAc/hexane). 

MS (ES+): mJz (%) = 394 (M+H 8l Br 37 C1, 34), 392 (M+H 79 Br 37 C1 ( 81 Br 35 C1), 
30 100), 390 (M+H 79 Br 35 C1, 67). 

'H-NMR (DMSO d 6 ): 6 = 3.79 (3H, s, CH 3 ), 7.25 (1H, d, J=7.9, ArH), 7.51-7.65 
(3H, m, ArH), 7.69 (1H, s, ArH), 7.90 (2H, m, ArH), 8.00 (2H, m, ArH), 10.51 (1H, s. NH). 

HPLC: retention time 3.40 min [Method A]. TLC: Rf 0.35 (50% EtOAc/hexane). 
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Experiment 7 

Preparation and Analysis of 1 16137 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-[4-(trifluoromethoxy)phenyl]acctamide 

5 

A solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (35mg, 0.14mmol) and 
triethylamine (23pL, 0.17mmol) in DMF (0.5mL) was added in one portion to a stirred 
solution of 4-trifluoromethoxyphenylacetic acid (31mg, 0.14mmol), HBTU (53mg, 
0.14mmol) and HOBT (19mg, 0.14mmol) in DMF (lmL). The mixture was heated at 70°C 
10 for 24 h and then quenched with aqueous sodium bicarbonate solution. Ethyl acetate was 
added and the organic phase separated, washed with water (x3), brine, dried (MgS 04 ) and 
evaporated. Chromatography on flash silica (50%EtOAc/hexane) gave the title compound 
as a colourless solid (43mg, 68%), m.p. 141.2-142.5°C (EtOAc/hexane). 

IR: v max = 1684,1592, 1510, 1253, 1217, 1 157, 987, 798, 700 cm' 1 . 

1 5 MS (ES+): m/z (%) = 456 (M+H 81 Br, 1 00), 454 (M+H 79 Br, 94). 

‘H-NMR (DMSO d$): 5 = 3.72 (2H, s, CH 2 ), 3.75 (3H, s, CH 3 ), 7.17 (1H, d, J=7.7, 
ArH), 7.33 (2H, d, J=8.7, ArH), 7.38-7.51 (3H, m, ArH), 7.62-7.73 (3H, m, ArH), 10.44 
(1H, s,NH). 

HPLC: retention time 3.52 min [Method A]. 

20 

Experiment 8 

Preparation and Analysis of 1 1 61 74 
N-[3-(4-bromo-l-methylpyrazol-3-yi)phenyl]-2-(3-fluorophenyl)acetamide 

25 A mixture of 3-(3-aminophenyl)-4-bromo-I-methylpyrazole (30 mg, 0.12 mmol), 3- 

fluorophenylacetic acid (18 mg, 0.12 mmol), 1-hydroxybenzotriazole hydrate (16 mg, 0.12 
mmol) and 2-(lH-benzotriazole-l-yl)-l,l,3,3-tetramethyluronium hexafluoro-phosphate (46 
mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropylethylamine (0.02 
ml, 0.13 mmol) was added and the mixture stirred at room temperature for 16h. The 
30 reaction mixture was then poured into brine and the organic layer washed with further brine, 
dried over magnesium sulphate and then concentrated in vacuo. The crude product was 
purified by column chromatography (ethyl acetate-toluene, 1:1), giving the title compound 
(12 mg, 26%). Rf0.41 (ethyl acetate-toluene, 1:1). 
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HPLC (Method B): retention time 7.07 min (100 %). 5 H (CDC1 3 ) 3.77 (2H, s), 3.83 
(3H, s), 7.02 - 7.20 (4H, m), 7.54 (1H, s), 7.60 - 7.63 (1H, m). MS (AP+): m/z (%) = 390 
(M + H 81 Br, 100), 388 (M + H 79 Br, 100). 

5 Experiment 9 

Preparation and Analysis of 1 1 61 75 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(3-methoxyphenyl)acetamide 

A solution of 3-methoxyphenylacetyl chloride (0.02 ml, 0.12 mmol) in 
10 dichloromethane (0.75 ml) was added dropwise at 0 °C to a solution of 3-(3-aminophenyl)- 
4-bromo-l-methylpyrazole (30 mg, 0.12 mmol) and triethylamine (0.02 ml, 0.13 mmol) in 
dichloromethane (0.75 ml). The resulting mixture was stirred at room temperature for 16h 
and then poured into brine. The organic layer was washed with more brine then dried over 
magnesium sulphate and concentrated in vacuo. The crude product was purified by column 
15 chromatography (ethyl acetate-toluene, 1:1), giving the title compound (9 mg, 19 %). Rf 
0.30 (ethyl acetate-toluene, 1:1). 

HPLC (Method B): retention time 8.62 min (97.09 %). 8 H (CDC1 3 ) 3.76 (2H, s), 
3.82 (3H, s), 3.85 (3H, s), 6.84 - 6.90 (3H, m), 7.07 - 7.44 (5H, m), 7.53 (1H, s), 7.60 (1H, 
br s). MS (AP+): m/z (%) = 402 (M + H 81 Br, 100), 400 (M + H 79 Br, 95). 

20 

Experiment 10 

Preparation and Analysis of 116176 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(2-fluorophenyl)acetainide 

25 A mixture of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30 mg, 0.12 mmol), 2- 

fluorophenylacetic acid (18 mg, 0.12 mmol), 1-hydroxybenzotriazole hydrate (16 mg, 0.12 
mmol) and 2-(lH-benzotriazole-l-yl)-l,l,3,3-tetramethyluronium hexafluoro-phosphate (46 
mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropylethylamine (0.02 
ml, 0.13 mmol) was added and the mixture stirred at room temperature for 16h. The 
30 reaction mixture was then poured into brine and the organic layer washed with further brine, 
dried over magnesium sulphate and then concentrated in vacuo. The crude product was 
purified by column chromatography (ethyl acetate-toluene, 1:1), giving the title compound 
(15 mg, 32 %). Rf 0.52 (ethyl ac. .e-toluene, 1:1). 
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HPLC (Method B): retention time 7.28 min (100 %). 8 h (CDCI3) 3.79 (2H, s), 3.83 
(3H, s), 7.11 - 7.23 (3H, m), 7.30 - 7.55 (6H, m), 7.61 - 7.64 (1H, m). MS (AP+): m/z (%) 
= 390 (M + H 81 Br, 100), 388 (M + H 79 Br, 100). 


5 Experiment 1 1 

Preparation and Analysis of 1 1 6 1 77 
N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]-2-(4-nitrophenyl)acetamide 

A mixture of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30 mg, 0.12 mmol), 4- 
10 nitrophenylacetic acid (22 mg, 0.12 mmol), 1-hydroxybenzotriazole hydrate (16 mg, 0.12 
mmol) and 2-(lH-benzotriazole-l-yl)-l,l,3,3-tetramethyluronium hexafluorophosphate (46 
mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropylethylamine (0.02 ml, 
0.13 mmol) was added and the mixture stirred at room temperature for 16h. The reaction 
mixture was then poured into brine and the organic layer washed with further brine, dried over 
15 magnesium sulphate and then concentrated in vacuo. The crude product was purified by 
column chromatography (ethyl acetate-toluene, 1:1), giving the title compound (9 mg, 18 %). 
Rf 0.19 (ethyl acetate-toluene, 1:1). 

HPLC (Method B): retention time 7.22 min (94.30 %). 5 h (CDCI 3 ) 3.83 (3H, s), 3.87 
(2H, s), 7.18 - 7.23 (1H, m), 7.42 - 7.65 (7H, m), 8.22 - 8.30 (2H, m). MS (AP+): m/z (%) = 
20 41 7 (M + H 81 Br, 100), 415 (M + H 79 Br, 100). 

Experiment 12 

Preparation and Analysis of 1 1 61 78 
N-[3-(4-bromo- 1 -methylpyrazol-3-yl)pheny l]-2-(2-methoxypheny l)acetamide 
25 

A mixture of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (30 mg, 0.12 mmol), 2- 
methoxyphenylacetic acid (20 mg, 0.12 mmol), 1-hydroxybenzotriazole hydrate (16 mg, 
0.12 mmol) and 2-(lH-benzotriazole-l-yl)-l,l,3,3-tetramethyluronium hexafluoro- 
phosphate (46 mg, 0.12 mmol) were dissolved in chloroform (1.5 ml). N, N-Diisopropyl- 
30 ethylamine (0.02 ml, 0.13 mmol) was added and the mixture stirred at room temperature for 
16h. The reaction mixture was then poured into brine and the organic layer washed with 
further brine, dried over magnesium sulphate and then concentrated in vacuo. The crude 
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product was- purified by column chromatography (chloroform-methanol, 99:1), giving the 
title compound (18 mg, 38 %) as a colourless solid. Rf 0.65 (chloroform-methanol, 98:2). 

HPLC (Method B): retention time 7.16 min (100 %). 5h (CDCb) 3.76 (2H, s), 3.83 
(3H, s), 3.98 (3H, s), 6.97 - 7.06 (2H, m), 7.1 1-7.16 (1H, m), 7.31 - 7.50 (4H, m). 7.53 
5 (1H, s), 7.57 - 7.60 (1H, m), 7.91 (1H, br s). MS (AP-): m/z (%) = 400 (M - H 81 Br, 90), 

398 (M-H 79 Br, 100). 


Experiment 13 

Preparation and Analysis of 1 16192 

10 {[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(l,l-dimethylethoxy)carboxamide 

To di-/er/-butyl dicarbonate (36mg, 0.17mmol) in methanol (lmL) was added 
dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (42mg, 0.17mmol) in 
methanol (lmL). The mixture was stirred for 16 h and concentrated. Chromatography on 
flash silica (40%EtOAc/heaxne) gave the title compound as a colourless solid (29mg, 49%) 

15 (EtO Ac/hexane). 

MS (CI-): m/z (%) = 352 (M-H 81 Br, 100), 350 (M-H 79 Br, 96). 

'H-NMR (DMSO ds): 6 = 1.46 (9H, s, 3xCH 3 ), 3.73 (3H, s, CH 3 ), 7.07 (1H, m, 
ArH), 7.42 (1H, t, 1=1.1, ArH), 7.53-7.60 (2H, m, ArH), 7.64 (1H, s, ArH), 9.57 (1H, s, 
NH). 

20 HPLC: retention time 7. 1 5 min [Method B]. 

One or the other (as indicated) of the two following synthetic protocols was used to 
generate each of the compounds below: 

Protocot Ar- 

25 To an isocyanate (lmmol) in CH 2 C1 2 (4mL) was added dropwise a solution of 3-(3- 

aminophenyl)-4-bromo- 1 -methy lpyrazole (lmmol) in CH 2 C1 2 (4mL). The mixture was 
stirred for 16 hours and concentrated. Chromatography on flash silica (20%-80% 
EtOAc/hexane) followed by recrystallisation gave the pure urea. 

Protocol B: 

30 To a stirred solution of triphosgene (0.33mmol) in CH 2 C1 2 (4mL) was added 

dropwise a solution of 3-(3-aminophenyl)-4-bromo-l-methylpyrazole (lmmol) and 
triethylamine (2mmol) in CH 2 C1 2 (4mL). After 1 hour, an aniline was added (lmmol). The 
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reaction mixture was stirred for 16 hours and concentrated. Chromatography on flash silica 
(20%-80%EtOAc/hexane) followed by recrystallisation gave the pure urea. 


5 Experiment 14 

Preparation and Analysis of 1 16079 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][(4-methylthiophenyl)amino]carboxamide 

[Protocol A] - 4-(methylthio)phenyl isocyanate 
10 colourless solid (EtO Ac/hexane) 

MS (ES+): m/z (%) = 419 (M+H 81 Br, 100), 417 (M+H 79 Br, 94). 

‘H-NMR (MeOH cU): 6 = 2.42 (3H, s, SCH 3 ), 3.81 (3H, s, NCH 3 ), 7.06 (1H, m, 
ArH), 7.22 (2H, m, ArH), 7.37 (2H, m, ArH), 7.42-7.61 (4H, m, ArH). 

HPLC: retention time 3.35 min [Method A]. 

15 

Experiment 15 

Preparation and Analysis of 1 16081 

N-[3-(4-bromo-l-methylpyrazol-3-yl)phenyl][ (4-chlorophenyl)amino]carboxamide 

20 [Protocol A] - 4-chlorophenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): mJz (%) = 409 (M+H 8, Br 37 C1, 19), 407 (M+H 79 Br 37 C1 ( 8l Br 35 C1), 
100), 405 (M+H 79 Br 35 C1, 81). 

'H-NMR (MeOH cL,): 6 = 3.81 (3H, s, CH 3 ), 7.07 (1H, m, ArH), 7.23 (2H, m, ArH), 
25 7.36-7.60 (6H, m, ArH). 

HPLC: retention time 3.42 min [Method A]. 

Experiment 16 

30 Preparation and Analysis of 1 1 6082 

{ [3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl]amino}-N-(4-fluorophenyl)carboxamide 
[Protocol A] - 4-fluorophenyl isocyanate 
colourless solid (EtOAc/hexane) 
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MS (ES+): m/z (%) = 391 (M+H 8l Br, 96), 389 (M+H 79 Br, 100). 

‘H-NMR (MeOH d 4 ): 6 = 3.81 (3H, s, CH 3 ), 6.93-7.1 1 (3H, m, ArH), 7.37-7.61 (6H, 
m. ArH). 

HPLC: retention time 3.1 1 min. 

Experiment 17 

Preparation and Analysis of 1 16087 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[2-(trifluoromethoxy)phenyl]carboxamide 
[Protocol A] - 2-(trifluoromethoxy)phenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 457 (M+H 81 Br, 100), 455 (M+H 79 Br, 95). 

'H-NMR (DMSO d 6 ): 8 = 3.79 (3H, s, CH 3 ), 7.06-7.18 (2H, m, ArH), 7.38-7.49 
(2H, m, ArH), 7.51-7.62 (2H, m, ArH), 7.65 (1H, m, ArH), 7.71 (1H, s, ArH), 8.24 (1H, dd, 
J=l.l, 8.2, ArH), 8.56 (1H, s,NH), 9.49 (1H, s,NH). 

HPLC: retention time 3.40 min. 

Experiment 1 8 

Preparation and Analysis of 1 1 6089 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-nitrophenyl)carboxamide 

[Protocol A] - 2-nitrophenyl isocyanate 
yellow solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 418 (M+H 81 Br, 98), 416 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 8 = 'H-NMR (DMSO d 6 ): □ = 3.79 (3H, s, NCH 3 ), 7.14 (1H, 
m, ArH), 7.24 (1H, m, ArH), 7.50 (1H, t, J=7.7, ArH), 7.60 (2H, m, ArH), 7.67 (1H, s, 
ArH), 7.71 (1H, s, ArH), 8.10 (1H, m, ArH), 8.29 (1H, m, ArH), 9.65 (1H, s, NH), 10.09 
(1H, s, NH). 

HPLC: retention time 3.10 min [Method A], 

Experiment 19 

Preparation and Analysis of 1 16091 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(4-methoxyphenyl)carboxamide 

[Protocol A] - 4-methoxyphenyl isocyanate 
colourless solid (EtOAc/hexane) 
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MS (ES+); m/z (%) = 403 (M+H 81 Br, 100), 401 (M+H 79 Br, 96). 

'H-NMR (DMSO d 6 ): 8 = 3.71 (3H, s, OCH 3 ), 3.79 (3H, s, NCH 3 ), 6.87 (2H, d, 
J=8.9, ArH), 7.06 (1H, d, J=7.5, ArH), 7.39 (2H, d, J=8.9, ArH), 7.45-7.61 (3H, m, ArH), 
7.65 (1H, s, ArH), 8.52 (1H, s, NH), 8.84 (1H, s, NH). 

5 HPLC: retention time 3.08 min. 

Experiment 20 

Preparation and Analysis of 1 1 6092 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-methylphenyl)carboxamide 
10 [Protocol A] - o-tolyl isocyanate 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 387 (M+H 8l Br, 94), 385 (M+H 79 Br, 1 00). 

'H-NMR (MeOH <U): 8 = 2.29 (3H, s, CH 3 ), 3.81 (3H, s, NCH 3 ), 7.03 (1H, dt, 
J=l. 1,7.5, ArH), 7.09 (1H, dt, J=l.l, 7.5, ArH), 7.13-7.22 (2H, m, ArH), 7.45 (1H, t, J=7.9, 
1 5 ArH), 7.49-7.57 (2H, m, ArH), 7.60-7.68 (2H, m, ArH). 

HPLC: retention time 2.96 min. 

Experiment 21 

Preparation and Analysis of 1 16097 

20 {[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[4-(trifluoromethyl)phenyl]carboxamide 

[Protocol A] - 4-(trifluoromethyl)phenyl isocyanate 
colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 441 (M+H 81 Br, 94), 439 (M+H 79 Br, 100). 

'H-NMR (MeOH cU): 8 = 3.82 (3H, s, CH 3 ), 7.04-7.16 (3H, m, ArH), 7.20-7.47 (6H, 
25 m, ArH). 

HPLC: retention time 3.56 min. 

Experiment 22 

Preparation and Analysis of 1 16105 

30 {[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-chlorophenyl)carboxaniide 

[Protocol A] - 3-chlorophenyl isocyanate 
colourless solid (EtOAc/hexane) 
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MS (ES+): m/z (%) = 409 (M+H 81 Br 37 C1, 26), 407 (M+H 79 Br 37 C1 ( 8l Br 35 C1). 
100), 405 (M+H 79 Br 35 C1, 70). 

'H-NMR (MeOH d 4 ): 5 = 3.81 (3H, s, NCH 3 ), 7.04 (1H, m, ArH), 7.10 (1H, m, 
ArH), 7.28 (2H, m, ArH), 7.47 (1H, t, J=7.8, ArH), 7.55 (1H, m, ArH), 7.63 (1H. m. ArH), 
5 7.68 (1H, s, ArH), 7.73 (1H, m, ArH), 9.04 (2H, s, NH). 

HPLC: retention time 3.20 min [Method A]. 

Experiment 23 

Preparation and Analysis of 1 16108 

10 {[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-chlorophenyl)carboxamide 

[Protocol A] - 2-chlorophenyl isocyanate 
colourless solid (EtO Ac/hexane) 

MS (ES+): m/z (%) = 409 (M+H 81 Br 37 C1, 24), 407 (M+H 79 Br 37 C1 ( 81 Br 35 C1), 
100), 405 (M+H 79 Br 35 C1, 72). 

15 'H-NMR (MeOH d,): 6 = 3.81 (3H, s, NCH 3 ), 7.03 (1H, m, ArH), 7.11 (1H, m, 

ArH), 7.28 (1H, m, ArH), 7.35-7.53 (3H, m, ArH), 7.55 (1H, s, ArH), 7.62 (1H, m, ArH), 
8.11 (1H, m, ArH). 

HPLC: retention time 3.13 min. 


20 


25 


Experiment 24 

Preparation and Analysis of 1 1 6 1 1 0 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[4-(methylethyl)phenyl]carboxamide 
[Protocol A] - 4-isopropylphenyl isocyanate 
colourless solid (THF/hexane) 

MS (ES+): m/z (%) = 415 (M+H 81 Br, 100), 413 (M+H 79 Br, 92). 

'H-NMR (MeOH <L,): 5 = 1.23 (6H, d, J=6.8, 2xCH 3 ), 2.86 (1H, septet, J=6.8, CH), 
3.82 (3H, s, NCH 3 ), 7.09 (1H, m, ArH), 7.16 (2H, d, J=7.6, ArH), 7.31 (2H, d, J=7.6, ArH), 
7.42-7.51 (2H, m, ArH), 7.54 (1H, s, ArH), 7.59 (1H, m, ArH). 

HPLC: retention time 3.66 min. 


30 

Experiment 25 

Preparation and Analysis of 1 161 1 1 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-methoxyphenyl)carboxamide 
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[Protocol A] - 3-methoxyphenyl isocyanate 
colourless solid (EtO Ac/hexane) 

MS (ES+): m/z (%) = 403 (M+H 8l Br, 100), 401 (M+H 79 Br, 96). 

‘H-NMR (MeOH d,): 8 = 3.73 (3H, s, OCH 3 ), 3.81 (3H, s, NCH 3 ), 6.59 (1H, m, 
5 ArH), 6.91 (1H, m, ArH), 7.08 (1H, m, ArH), 7.14 (2H, m, ArH), 7.39-7.61 (4H, m, ArH). 
HPLC: retention time 2.90 min. 

Experiment 26 

Preparation and Analysis of 1 161 12 

10 {[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3-methylphenyl)carboxamide 

[Protocol A] - m-tolyl isocyanate 

colourless solid (EtO Ac/hexane) 

MS (ES+): m/z (%) = 387 (M+H 8l Br, 100), 385 (M+H 79 Br, 96). 

'H-NMR (DMSO d 6 ): 8 = 2.26 (3H, s, CH 3 ), 3.76 (3H, s, NCH 3 ), 6.79 (1H, m, 
15 ArH), 7.06-7.22 (3H, m, ArH), 7.29 (1H, m, ArH), 7.43-7.62 (3H, m, ArH), 7.68 (1H, s, 
ArH), 8.65 (1H, s, NH), 8.89 (1H, s, NH). 

HPLC: retention time 3.05 min [Method A]. 

Experiment 27 

20 Preparation and Analysis of 1 1 6 1 1 3 

{ [3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl]amino}-N-methy l-N-[4- 
(trifluoromethoxy)phenyl]carboxamide 
[Protocol B] - AT-methyl-4-(trifluoromethoxy)aniline 
pale yellow solid (EtO Ac/hexane) 

25 MS (ES+): m/z (%) = 471 (M+H 81 Br, 88), 469 (M+H 79 Br, 1 00). 

‘H-NMR (MeOH di): 8 = 3.35 (3H, s, NCH 3 ), 3.81 (3H, s, NCH 3 ), 7.09 (1H, m, 
ArH), 7.25-7.5 1 (8H, m, ArH). 

HPLC: retention time 3.56 min [Method A]. 

30 Experiment 28 

Preparation and Analysis of 1 161 19 

N-[4-(ter/-butyl)phenyl]{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}carboxamide 

[Protocol B] - 4-rm-butylaniline 
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colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 429 (M+H 8l Br, 98), 427 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 8 = 1.27 (9H, s, 3xCH 3 ), 3.79 (3H, s, NCH 3 ), 7.07 (1H, d, 
J=7.5, ArH), 7.29 (2H, d, J=8.7, ArH), 7.37 (2H, d, J=8.7, ArH), 7.45 (1H, t, J=7.5, ArH), 

5 7.51-7.60 (2H, m, ArH), 7.66 (1H, s, ArH), 8.65 (1H, s, NH), 8.83 (1H, s, NH). 

HPLC: retention time 3.77 min. 

Experiment 29 

Preparation and Analysis of 1 16122 

1 0 N-[4-(d imethylamino)phenyl] { [3-(4-bromo- 1 -methy lpyrazol-3-yl)phenyl]amino } carboxamide 

[Protocol B] - ALA/-dimethyl-p-phenylenediamine 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 416 (M+H sl Br, 96), 414 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 8 = 2.86 (6H, s, NCH 3 ), 3.80 (3H, s, NCH 3 ), 6.80 (2H, m, 

15 ArH), 7.09 (1H, d, J=7.7, ArH), 7.28 (2H, m, ArH), 7.42 (1H, t, J=7.8, ArH), 7.52 (1H, m, 
ArH), 7.59 (1H, s, ArH), 7.67 (1H, s, ArH), 8.45 (1H, s, NH), 8.75 (1H, s, NH). 

HPLC: retention time 2.07 min [Method A]. 

Experiment 30 

20 Preparation and Analysis of 1 1 6 1 38 

N-(3,5-dichloro-4-methylpheny!){[3-(4-bromo-l-methylpyrazol-3-yl)pheny!]amino} carboxamide 
[Protocol B] - 3,5-dichloro-4-methylphenylamine 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 457 (M+H, 35), 455 (M+H, 100), 453 (M+H, 65). 

25 'H-NMR (DMSO cU): 8 = 2.32 (3H, s, CH 3 ), 3.79 (3H, s, NCH 3 ), 7.1 1 (1H, d, J=7.4, 

ArH), 7.46 (1H, t, J=7.8, ArH), 7.50-7.64 (4H, m, ArH), 7.68 (1H, s, ArH), 9.02 (1H, s, 
NH), 9.09 (1H, s, NH). 

HPLC: retention time 3.66 min. 

30 Experiment 3 1 

Preparation and Analysis of 1 16139 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-[4-(trifluoromethylthio)phenyl]carboxamide 
[Protocol B] - 4-(trifluoromethylthio)aniline 
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colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 473 (M+H 81 Br, 100), 471 (M+H 79 Br, 94). 

' H-NMR (DMSO d 6 ): 5 = 3.81 (3H, s. NCH 3 ), 7.1 1 (1H, d, J=7.5, ArH), 7.47 (1H, t, 
J=7.9, ArH), 7.51-7.63 (6H, m, ArH), 7.66 (1H, s, ArH), 9.03 (1H, s, NH), 9.16 (1H, s, 
5 NH). 

HPLC: retention time 3.76 min. 

Experiment 32 

Preparation and Analysis of 1 16141 

1 0 { [3-(4-bromo- 1 -methylpyrazol-3-yl)pheny!]amino}-N-(cyclohexyl)carboxamide 

[Protocol B] - cyclohexylamine 

colourless solid, m.p. 1 55.5-1 56.3°C (EtOAc/hexane). 

MS (ES+): m/z (%) = 379 (M+H 8l Br, 93), 377 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 6 = 1.07-1.34 (5H, m, 5xCH), 1.52 (1H, m, CH), 1.63 (2H, m, 
15 2xCH), 1.76 (2H, m, 2xCH), 3.48 (1H, m, NCH), 3.74 (3H, s, CH 3 ), 6.15 (1H, d, J=7.8, 
ArH), 6.98 (1H, d, J=7.5, ArH), 7.32-7.43 (2H, m, ArH), 7.51 (1H, m, NH), 7.62 (1H, s, 
ArH), 8.50 (1H, s,NH). 

HPLC: retention time 3.16 min [Method A]. 

TLC: retention factor 0.35 (50% EtOAc/hexane). 

20 

Experiment 33 

Preparation and Analysis of 1 16143 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(phenylmethyl)carboxamide 
[Protocol B} - benzylamine 

25 colourless solid, m.p. 144.5-146.2°C (EtOAc/hexane). 

IR: Dmax = 1622, 1565, 1467, 1374, 1239, 973, 802, 752, 695 cm' 1 . 

MS (ES+): m/z (%) = 387 (M+H 81 Br, 89), 385 (M+H 79 Br, 100). 

'H-NMR (CD 3 OD): 5 = 3.81 (3H, s, CH 3 ), 4.40 (2H, s, CH 2 ), 7.05 (1H, m, ArH), 
7.19-7.51 (9H, m, ArH). 

30 HPLC: retention time 3.06 min [Method A].a 
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Experiment 34 

Preparation and Analysis of 1 1 6144 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-fluorophenyl)carboxamide 
[Protocol A] - 2-fluorophenyl isocyanate 
5 colourless solid (DCM/hexane) 

MS (ES+): m/z (%) = 391 (M+H 8l Br, 100), 389 (M+H 79 Br, 90). 

'H-NMR (MeOH d 4 ): 8 = 3.79 (3H, s, NCH 3 ), 7.00-7.11 (4H, m, ArH), 7.40-7.56 
(3H, m, ArH), 7.61 (1H, m, ArH), 8.09 (1H, m, ArH). 

HPLC: retention time 3.01 min. 


Experiment 35 

Preparation and Analysis of 1 1 6145 

2-({[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}carbonylamino)benzamide 
[Protocol B] - 2-aminobenzamide 
15 colourless solid (EtO Ac/hexane) 

MS (ES+): m/z (%) = 399 (M+H -17 81 Br, 100), 397 (M+H - 17 79 Br, 94). 

'H-NMR (DMSO d 6 ): 8 = 3.79 (3H, s, NCH 3 ), 6.93-7.10 (2H, m, ArH), 7.45 (2H, t, 
J=7.8, ArH), 7.59-7.72 (5H, m, ArH), 8.22 (2H, m), 9.92 (1H, s, NH), 10.69 (1H, s, NH). 
HPLC: retention time 2.88 min. 


Experiment 36 

Preparation and Analysis of 1 16147 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(4-cyanophenyl)carboxamide 
[Protocol B] - 4-aminobenzonitrile 
25 colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 398 (M+H 8l Br, 100), 396 (M+H 79 Br, 96). 

'H-NMR (MeOH d,): 8 = 3.81 (3H, s, NCH 3 ), 7.12 (1H, m, ArH), 7.46-7.57 (3H, m, 
ArH), 7.62-7.69 (5H, m, ArH). 

HPLC: retention time 3.12 min. 

30 


Experiment 37 

Preparation and Analysis of AR1 16148 
{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(2-cyanophenyl)carboxamide 
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[Protocol B] - 2-aminobenzonitrile 

colourless solid (EtOAc/hexane) 

MS (ES+): m/z (%) = 398 (M+H 8l Br, 95), 396 (M+H 79 Br, 100). 

'H-NMR (CDCI3): 5 = 3.79 (3H, s, CH 3 ), 7.13-7.28 (2H, m, ArH), 7.49 (1H, t, 
5 J=7.8, ArH), 7.57 (1H, m, ArH), 7.62 (1H, m, ArH), 7.65-7.71 (2H, m, ArH), 7.78 (1H, m, 

ArH), 8.07 (1H, d, J=8.6, ArH), 8.83 (1H, s, NH), 9.62 (1H, s, NH). 

HPLC: retention time 3.05 min [Method A]. 


Experiment 38 

1 0 Preparation and Analysis of 1 1 6 1 82 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyi]amino}-N-(4-fluorophenylmethyl)carboxamide 
[Protocol B] - 4-fluorobenzylamine 

colourless solid, m.p. 1 85.5-1 86.6°C (EtOAc/hexane). 

MS (ES+): m/z (%) = 405 (M+H 81 Br, 97), 403 (M+H 79 Br, 100). 

15 'H-NMR (DMSO d 6 ): 6 = 3.75 (3H, s, CH 3 ), 4.28 (2H, d, J=6.0, CH 2 ), 6.73 (1H, t, 

J=5.9, NH), 7.01 (1H, d, J=7.5, ArH), 7.10-7.18 (2H, m, ArH), 7.27-7.41 (4H, m, ArH), 
7.56 (1H, s, ArH), 7.62 (1H, s, ArH), 8.82 (1H, s, NH). 

HPLC : retention time 3.10 min [Method A] . 

TLC: retention factor 0.25 (50% EtOAc/hexane). 


20 

Experiment 39 

Preparation and Analysis of 1 1 6 1 83 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(3,4-dimethoxyphenylmethyl)carboxamide 
[Protocol B] - 3,4-dimethoxybenzylamine 
25 colourless solid, m.p. 174.9-175.5°C (EtOAc/hexane). 

MS (C1+): m/z (%) = 447 (M+H 81 Br, 100), 445 (M+H 79 Br, 92). 

'H-NMR (DMSO d 6 ): 8 = 3.71 (3H, s, CH 3 ), 3.73 (3H, s, CH 3 ), 3.76 (3H, s, CH 3 ), 
4.22 (2H, d, J=5.8, CH 2 ), 6.62 (1H, t, J=5.7, NH), 6.80 (1H, m, ArH), 6.89 (2H, m, ArH), 
6.98 (1H, m, ArH), 7.36-7.51 (3H, m, ArH), 7.63 (1H, s, ArH), 8.76 (1H, s, NH). 

30 HPLC: retention time 2.86 min [Method A]. 

TLC: retention factor 0.20 (50% EtOAc/hexane). 
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Experiment 40 

Preparation and Analysis of 1 16184 

{[3-(4-bromo-l -methylpyrazol-3-yl)phenyl]amino}-N-(3,4,5-trimethoxyphenylmethyl)carboxamide 
[Protocol B] - 3,4,5-trimethoxybenzylamine 
5 colourless solid (EtOAc/hexane). 

MS (CI+): m/z (%) = 477 (M+H 81 Br, 100), 475 (M+H 79 Br, 95). 

'H-NMR (DMSO d 6 ): 6 = 3.63 (3H, s, OCH 3 ), 3.75 (9H, s, 3xCH 3 ), 4.21 (1H, d, 
J=5.9, CH 2 ), 6.61 (2H, s, ArH), 6.65 (1H, t, J=5.9, NH), 6.99 (1H, m, ArH), 7.40 (1H, t, 
J=7.7, ArH), 7.45 (1H, m, ArH), 7.56 (1H, m, ArH), 7.64 (1H, s, ArH), 8.77 (1H, s, NH). 

10 HPLC: retention time 5.91 min [Method B]. 

TLC: retention factor 0.50 (50% EtOAc/hexane). 

Experiment 41 

Preparation and Analysis of 1 16185 

15 {[3-(4-bromo-l-methylpyrazoI-3-yl)phenyl]amino}-N-(2-methylphenylmethyl)carboxamide 

[Protocol B] - 2-methylbenzylamine 

colourless solid (EtOAc/hexane). 

MS (CI+): m/z (%) = 401 (M+H 81 Br, 96), 399 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 5 = 2.28 (3H, s, CH 3 ), 3.76 (3H, s, NCH 3 ), 4.28 (1H, d, J=5.8, 
20 CH 2 ), 6.60 (1H, t, J=5.8, NH), 7.01 (1H, m, ArH), 7.15 (3H, m, ArH), 7.24 (1H, m, ArH), 

7.38-7.50 (2H, m, ArH), 7.57 (1H, m, ArH), 7.65 (1H, s, ArH), 8.77 (1H, s, NH). 

HPLC: retention time 2.74 min [Method A]. 

TLC: retention factor 0.20 (50% EtOAc/hexane). 

25 Experiment 42 

Preparation and Analysis of 1 16189 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyl]amino}-N-(4-methoxyphenylmethyl)carboxamide 
[Protocol B] - 4-methoxybenzylamine 
colourless solid (EtOAc/hexane). 

MS (CI+): m/z (%) = 417 (M+H 81 Br, 94), 415 (M+H 79 Br, 100). 

‘H-NMR (DMSO d 6 ): 6 = 3.72 (3H, s, CH 3 ), 3.77 (3H, s, NCH 3 ), 4.22 (1H, d, J=5.9, 
CH 2 ), 6.62 (1H, t, J=5.9, NH), 6.90 (2H, d, J=8.8, ArH), 7.00 (1H, m, ArH), 7.23 (2H, d. 


30 
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J=8.8. ArH), 7.39 (1H, t, J=7.8, ArH), 7.43 (1H, m. ArH), 7.56 (1H, m. ArH), 7.64 (1H, s. 
ArH), 8.73(1 H, s, NH). 

HPLC: retention time 6.41 min [Method B]. 

TLC: retention factor 0.25 (50% EtOAc/hexane). 

5 

Experiment 43 

Preparation and Analysis of 1 16194 

{[3-(4-bromo-l-methylpyrazol-3-yl)phenyI]amino}-N-[2-(4-methoxy)phenylethyl]carboxamide 
[Protocol B] - 2-(4-methoxyphenyl)ethylamine 
10 colourless solid (EtOAc/hexane). 

MS (ES+): m/z (%) = 431 (M+H 8l Br, 95), 429 (M+H 79 Br, 100). 

'H-NMR (DMSO d 6 ): 5 = 2.68 (2H, t, J=7.1, CH 2 ), 3.31 (2H, m, CH 2 ), 3.71 (3H, s, 
CHj), 3.77 (3H, s, CH 3 ), 6.16 (1H, t, J=5.8, NH), 6.87 (2H, d, J=8.6, ArH), 6.99 (1H, dt, 
J=1 .4, 7.3, ArH), 7.16 (2H, d, J=8.6, ArH), 7.33-7.48 (2H, m, ArH), 7.52 (1H, m, ArH), 
15 7.63 (lH,s, ArH), 8.71 (lH,s,NH). 

HPLC: retention time 6.62 min [Method B]. 

An important point that can be derived from the foregoing data is that by using a 
constitutively activated form of the receptor in the direct identification of candidate 
20 compounds, the selectivity of the compounds is exceptional: as those in the art appreciate, the 
homology between the human 5HT2A and 5HT2C receptors is about 95%, and even with such 
homology, certain of the directly identified compounds evidence a 4-order-of-magnitude 
(10,000-fold) selectivity separation (116100). This is important for pharmaceutical 
compositions in that such selectivity can help to reduce side-effects associated with interaction 
25 of a drug with a non-target receptor. 

Different embodiments of the invention will consist of different constitutively activated 
receptors, different expression systems, different assays, and different compounds. Those 
skilled in the art will understand which receptors to use with which expression systems and 
assay methods. All are considered within the scope of the teaching of this invention. In 
30 addition, those skilled in the art will recognize that various modifications, additions, 
substitutions, and variations to the illustrative examples set forth herein can be made without 
departing from the spirit of the invention and are, therefore, considered within the scope of the 
invention. 
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CLAIMS 

We claim: 

1. A cDNA encoding a constitutively active, non-endogenous version, of a human 

5 5HT2C serotonin receptor comprising SEQ. ID NO. 28. 

2 . A constitutively active non-endogenous human 5HT2C serotonin receptor encoded 
by the cDNA of SEQ ID NO. 28 comprising SEQ ID NO. 29. 

3 . A cDNA encoding a constitutively active, non-endogenous version, of a human 
5HT2 a serotonin receptor comprising SEQ. ID NO. 30. 

10 4 . A constitutively active non-endogenous human 5HT2 a serotonin receptor encoded 

by the cDNA of SEQ ID NO. 30 comprising SEQ ID NO. 3 1 . 

5 . A cDNA encoding a constitutively active, non-endogenous version, of a human 
5HT2A serotonin receptor comprising SEQ. ID NO. 32. 

6 . A constitutively active non-endogenous human 5HT2 a serotonin receptor encoded 

15 by the cDNA of SEQ ID NO. 32 comprising SEQ ID NO. 33. 

7. A method for identifying whether a candidate compound is an inverse agonist to a 
non-endogenous human 5 HT 2 serotonin receptor comprising the steps of: 

a. contacting the candidate compound with a non-endogenous human 5 HT 2 
serotonin receptor ; and 

20 b. determining, by measurement of a second messenger response whether said 

compound is an inverse agonist. 

8. The method of claim 7 in which the non-endogenous human 5HT2 serotonin 
receptor comprises SEQ ID NO. 29. 

9. The method of claim 7 in which the non-endogenous human 5HT2 serotonin 

25 receptor comprises SEQ ID NO. 3 1 . 

10. The method of claim 7 in which the non-endogenous human 5HT2 serotonin 
receptor comprises SEQ ID NO. 33. 

11. An inverse agonist identified by the method of claim 7. 

1 2. A reagent for screening compounds to determine whether the compounds are inverse 

30 agonists at human 5 HT 2 serotonin receptors comprising a membrane fraction from 

mammal ian cells transfected with and expressing a cDNA encoding for a 
constitutively active, non-endogenous version, of a human 5 HT 2 serotonin receptor 
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in which the constitutively active non-endogenous human 5HT 2 receptor is 
expressed on the cell surface. 

13. A reagent for screening compounds to determine whether the compounds are inverse 
agonists at human 5HT 2 serotonin receptors comprising mammalian cells which 
produce a second messenger response, transfected with and expressing a cDNA 
encoding for a constitutively active, non-endogenous version, of a human 5HT 2 
serotonin receptor in which the constitutively active non-endogenous human 5HT 2 
receptor is expressed on the cell surface. 

14. A method for modulating by inverse agonism the activity of a human 5HT 2A 
serotonin receptor by contacting the receptor with a compound of formula: 



Wherein: 

W is lower alkyl (Cm), or halogen; 

V is lower alkyl (Cm), or halogen; 

X is either Oxygen or Sulfur; 

Y is NR 2 R 3 , or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 
Z is lower alkyl (Cm); 

m = 0 -4 
n =0-4 

R 1 is H or lower alkyl (Cm); 

R 2 is H or lower alkyl(CM); 
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R 3 and R 4 are independently aC^ alkyl, or Cm alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
substituents in any position independently selected from CFj, CCI3, Me, N 0 2 , OH, 
OMe, OEt, CONR s R 6 , NR 5 R 6 , OCFj, SMe, COOR 7 , S 0 2 NR 5 R 6 , SO3R 7 , COMe, 
5 COEt, CO-lower alkyl, SCF3CN, Cm alkenyl, H, halogens, C M alkoxy, C3.6 

cycloalkyl, Cm alkyl, aryl, and aryloxy wherein each of the C 3 ^ cycloalkyl, Cm 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF3, CCI3, Me, N 0 2 , OH, 
OMe, OEt, CONR 5 R 6 , NR S R 6 , NHCOCH3, OCF 3 , SMe, COOR 7 , SO3R 7 , 
10 S 0 2 NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl, H, halogens, Ci. 

4 alkoxy, C3-6 cycloalkyl, Cm alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or Cm alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
15 CF 3 , CCb, Me, N 0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH3, OCF 3 , SMe, 

COOR 9 , SO3R 7 , S 0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens. Cm alkoxy, C3.6 cycloalkyl. Cm alkyl, and aryl wherein each of the 
C3.6 cycloalkyl, Cm alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF3, CCI3, Me, 
20 N 0 2 , OH, OMe, OEt, CONR*R 9 , NR 8 R 9 , NHCOCH3, OCF 3 , SMe, COOR 7 . 

S 0 2 NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF3, CN, Cm alkenyl, H. 
halogens. Cm alkoxy, C3.6 cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5 , 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
25 selected from O, N or S and said cyclic structure may be optionally substituted by up 

to four substituents in any position independently selected from CF3, CCI3, Me. 
N 0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S 0 2 NR 8 R 9 , SO3R 7 , NHCOCH3, COEt. 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

30 R 8 and R 9 are independently a H, or Cm alkyl, or C 2 .6 alkenyl, or 

cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
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halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
S0 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
5 heterocyclic ring or a polycycle; 

15. A method for modulating by inverse agonism the activity of a human 5HT 2A 
serotonin receptor by contacting the receptor with a compound of formula: 



(B) 


Wherein: 

10 W is Me, or Et, or halogen; 

X is either Oxygen or Sulfur; 

Y is NR 2 R\ or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (Cm); 
m = 0 - 4 

15 n =0-4 

R 1 is H or lower alkyl (Cm); 

R 2 is H or lower alkyl(CM); 

R 3 and R 4 are independently a Cm alkyl, or Cm alkenyl, or cycloalkyl, or 
aryl group and each said group may be optionally substituted by up to four 
20 substituents in any position independently selected from CF 3 , CC1 3 , Me, N0 2 , OH. 

OMe, OEt CONR 5 R 6 , NR s R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe. 
COEt, CO-lower alkyl, SCF 3 CN, Cm alkenyl, H, halogens. Cm alkoxv. Cm 
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cycloalkyl, Cm alkyl, aryl, and aryloxy wherein each of the cycloalkyl, Cm 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
substituents in any position independently selected from CF3, CCI3, Me, NO2, OH, 
OMe, OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH3, OCF 3 , SMe, COOR 7 , SO3R 7 , 
5 S02NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF3, CN, C2.6 alkenyl, H, halogens, C|. 

4 alkoxy, C3^ cycloalkyl, C i_6 alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or C24 alkenyl, or 
cycloalkyl, or aryl, or CH2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
10 CF 3 , CC 1 3 , Me, N 0 2 , OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH3, OCF 3 , SMe, 

COOR 9 , SO3R 7 , S 0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 * alkenyl, 
H, halogens. Cm alkoxy, C3-6 cycloalkyl. Cm alkyl, and aryl wherein each of the 
C3.6 cycloalkyl. Cm alkyl, or.aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF3, CCI3, Me, 
15 N 0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF 3 , SMe, COOR 7 , 

S02NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF3, CN, C2.6 alkenyl, H, 
halogens. Cm alkoxy, C3-6 cycloalkyl, Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
20 selected from O, N or S and said cyclic structure may be optionally substituted by up 

to four substituents in any position independently selected from CF3, CCI3, Me. 
NO2, OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 . S0 3 R 7 , NHCOCH3, COEt. 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

25 R 8 and R 9 are independently a H, or Cm alkyl, or C2.6 alkenyl, or 

cycloalkyl, or aryl, or CH2aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF 3 , OEt, CC 1 3 , Me, N 0 2 , OH, OMe, SMe, COMe, CN, COOR 7 . 
SO3R 7 , COEt, NHCOCH3, or aryl; 

30 an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 

to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 
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16 . A method for modulating by inverse agonism the activity of a human 5 HT 2A 
serotonin receptor by contacting the receptor with a compound of formula: 

2 




(I) 


Wherein: 

5 Preferably R 1 and R 2 are H. 

Preferably W is Br. 

Preferably X is O. 

Preferably Z is Me. 

R 3 is Ci-6 alkyl, or C 2 -6 alkenyl, or cycloalkyl, or aryl group and each said group 
10 may be optionally substituted by up to four substituents in any position 

independently selected from CF3, CCI3, Me, NO2, OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , OCF3, SMe, COOR 7 , S 0 2 NR 5 R 6 , SO3R 7 , COMe, COEt, CO-lower alkyl. 
SCF3CN, C 2 ^ alkenyl, H, halogens, C M alkoxy, cycloalkyl, C,. 6 alkyl, arvl, and 
aryloxy wherein each of the C3-6 cycloalkyl, C |. 6 alkyl, aryl, or aryloxy groups may 
15 be further optionally substituted by up to four substituents in any position 

independently selected from CF3, CCI3, Me, N 0 2 , OH, OMe, OEt, CONR 5 R 6 , 
NR 5 R 6 , NHCOCH3, OCF 3 , SMe, COOR 7 , SO3R 7 , S 0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF 3 , CN, C 2 -a alkenyl, H, halogens, C M alkoxy, C 3 . 6 cycloalkvl, C |. 6 
alkyl, and aryl; 

20 R 5 and R 6 are independently a H, or Ci* alkyl, or C 2 . 6 alkenyl, or 

cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF3, CCI3, Me, N 0 2 , OH, OMe, OEt, CONR 7 R*. NR 7 R 8 , NHCOCH3, OCF 3 . SMe. 
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COOR 9 , SO3R 7 , S 0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF3, CN, C 2 . 6 alkenyl, 
H, halogens, C M alkoxy, cycloalkyl, Cm alkyl, and aryl wherein each of the 
C 3 ^ cycloalkyl, Cm alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF3, CCI3, Me. 

5 N 0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF 3 , SMe, COOR 7 , 

S 0 2 NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF3, CN, C 2 .6 alkenyl, H, 
halogens. Cm alkoxy, C3.6 cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 

10 selected from O, N or S and said cyclic structure may be optionally substituted by up 

to four substituents in any position independently selected from CF 3 , CC1 3 , Me. 
N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 , SO 3 R 7 , NHCOCH 3 , COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

>5 R and R are independently a H, or Cm alkyl, or Cm alkenyl, or 

cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO 3 R 7 , COEt, NHCOCH 3 , or aryl; 

20 an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 

to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 

17. A method for modulating by inverse agonism the activity of a human 5HT2A 
serotonin receptor by contacting the receptor with a compound of formula: 
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(H) 

Wherein: 

Preferably R 1 is H. 

Preferably W is Br. 

5 Preferably X is 0 . 

Preferably Z is Me. 
n = 0 - 4 

R 4 is Cm alkyl, or C2-6 alkenyl, or cycloalkyl, or aryl group and each said 
group may be optionally substituted by up to four substituents in any position 
10 independently selected from CF3, CCI3, Me, N02, OH, OMe, OEt, CONR'R 6 , 

NR 5 R 6 , OCF 3 , SMe, COOR 7 , S 0 2 NR 5 R 6 , SO3R 7 , COMe, COEt, CO-lower alkyl. 
SCF3CN, C24 alkenyl, H, halogens. Cm alkoxy, C3.6 cycloalkyl. Cm alkyl, aryl, and 
aryloxy wherein each of the C3-6 cycloalkyl. Cm alkyl, aryl, or aryloxy groups may 
be further optionally substituted by up to four substituents in any position 
15 independently selected from CF3, CCI3, Me, N 0 2 , OH, OMe. OEt. CONR 5 R 6 . 

NR 5 R 6 , NHCOCH3, OCF 3 , SMe, COOR 7 , SO3R 7 , S 0 2 NR 5 R 6 , COMe, COEt. CO- 
lower alkyl, SCF3, CN, C 2 ^ alkenyl, H. halogens. Cm alkoxy, C3.6 cycloalkyl. Cm 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or Cm alkenyl, or 
20 cycloalkyl, or aryl, or CH2 aryl group and each said group may be optionally 

substituted by up to four substituents in any position independently selected from 
CF 3 , CCI3, Me. NO2, Oil, OMe, OEt, CONR 7 R®, NR 7 R S , NHCOCH3, OCF3. SMe. 
COOR 9 . SO3R 7 , S 0 2 NR 7 R 8 , COMe. COEt. CO-lower alkyl. SCF 3 . CN. C ; . 6 alkenyl. 
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H. halogens, Ci-i alkoxy, cycloalkyl, Cm alkyl, and aryl wherein each of the 
C 3.6 cycloalkyl, Cm alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me. 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR*R 9 , NHCOCH 3 , OCF 3> SMe, COOR 7 . 
5 S0 2 NR 8 R 9 , SO 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl. H. 

halogens. Cm alkoxy, C 3 ^ cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
10 to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 

N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 SO 3 R 7 , NHCOCH 3 , COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

R 8 and R 9 are independently a H, or Cm alkyl, or Cm alkenyl, or 
15 cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 

substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CCI 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SO 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
20 to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 

18. A method for modulating by inverse agonism the activity of a human 5HT2A 
serotonin receptor by contacting the receptor with a compound of formula: 
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m 

Wherein: 

Preferably W is Br. 

Preferably X is O. 

5 Preferably Z is Me. 

Preferably R 1 is H. 
m = 0 - 4 

R 4 is Ci-6 alkyl, or C 2 -6 alkenyl, or cycloalkyl, or aryl group and each said 
group may be optionally substituted by up to four substituents in any position 
10 independently selected from CF3, CCI3, Me, N 0 2 , OH, OMe, OEt, CONR'R 6 , 

NR 5 R 6 , OCF3, SMe, COOR 7 , S 0 2 NR 5 R 6 , SO3R 7 , COMe, COEt, CO-lower alkyl. 
SCF3CN, Cm alkenyl, H, halogens, Cm alkoxy, C3-6 cycloalkyl, C m alkyl, aryl, and 
aryloxy wherein each of the C3-6 cycloalkyl, Cm alkyl, aryl, or aryloxv groups may- 
be further optionally substituted by up to four substituents in any position 
15 independently selected from CF3, CCI3, Me, N 0 2 , OH, OMe, OEt, CONR 5 R 6 , 

NR 5 R 6 , NHCOCH3, OCF 3 , SMe, COOR 7 , SO3R 7 , S 0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF3, CN, Cm alkenyl, H, halogens, Cm alkoxy, C 3 ^ cycloalkyl, Ci- 6 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or Cm alkenyl, or 
20 cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally- 

substituted by up to four substituents in any position independently selected from 
CF 3 , CCI3, Me. N 0 2l OH, OMe, OEt, CONR 7 R 8 , NR 7 R 8 , NHCOCH3. OCF 3 . SMe. 
COOR 9 . SO3R 7 . S 0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN. C 2 . 6 alkenyl. 
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H, halogens, l'm alkoxy, C 3 . 6 cycloalkyl, Cm alkyl, and aryl wherein each of the 
Cm cycloalkyl, Cm alkyl, or_aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF 3 , CC1 3 , Me. 
N0 2 , OH, OMe, OEt, CONR 8 R 9 , NR g R 9 , NHCOCH 3 , OCF 3 , SMe, COOR 7 , 
5 S0 2 NR*R 9 , S0 3 R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl. H. 

halogens. Cm alkoxy, C 3 . 6 cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from 0, N or S and said cyclic structure may be optionally substituted by up 
10 to four substituents in any position independently selected from CF 3 , CC1 3 , Me, 

N0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S0 2 NR 8 R 9 S0 3 R 7 , NHCOCH 3 , COEt, 
COMe, or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

R 8 and R 9 are independently a H, or Cm alkyl, or C 2 .6 alkenyl, or 
15 cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 

substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CC1 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 , 
SOjR 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
20 to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle. 
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19. A compound of formula (C) 
R 1 



(C) 

Wherein: 

5 W is Me, or Et, or halogen; 

X is either Oxygen or Sulfur; 

Y is NR 2 R\ or (CH 2 ) m R 4 , or 0(CH 2 ) n R 4 ; 

Z is lower alkyl (Ci^); 
m = 0 - 4; 

10 n =0-4; 

R 1 is H or lower alkyl 
R 2 is H or lower aikyl(Ci-j); 

R 3 is a Ci^ alkyl, or C 2 ^ alkenyl, or cycloalkyl, or (CH 2 ) k aryl group (k = 1 - 
4), preferably k = 1, and each said group may be optionally substituted by up to four 
15 substituents in any position independently selected from CF3, CCI3, Me, N0 2 , OH. 

OMe, OEt, CONR s R 6 , NR 5 R 6 , OCF 3 , SMe, COOR 7 , S0 2 NR 5 R 6 , S0 3 R 7 , COMe. 
COEt, CO-lower alkyl, SCF 3 CN, C 2 -6 alkenyl, H, halogens, Cm alkoxv, C 3 .6 
cycloalkyl, Ci -6 alkyl, aryl, and aryloxy wherein each of the C 3.6 cycloalkyl, C |. 6 
alkyl, aryl, or aryloxy groups may be further optionally substituted by up to four 
20 substituents in any position independently selected from CF 3 , CCI 3 . Me. NO;. OH. 

OMe. OEt, CONR 5 R 6 , NR 5 R 6 , NHCOCH 3 , OCF3, SMe, COOR 7 . SO?R 7 . 
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SO:NR 5 R 6 , COMe, COEt, CO-lower alkyl, SCF3, CN, C 2 . 6 alkenyl, H, halogens. C|. 

4 alkoxy, C 3 . 6 cycloalkyl, Cm alkyl, and aryl; 

R 4 is a C1-6 alkyl, or C 2 . 6 alkenyl, or cycloalkyl, or aryl group and each said 
group may be optionally substituted by up to four substituents in any position 
independently selected from CF3, CCI3, Me, N 0 2 , OH, OMe, OEt, CONR'R , 
NR 5 R 6 , OCF 3 , SMe, COOR 7 , S 0 2 NR 5 R 6 , SO3R 7 , COMe, COEt, CO-lower alkyl, 
SCF3CN, Cm alkenyl, H, halogens, C M alkoxy, C 3 ^ cycloalkyl, Cm alkyl, aryl, and 
aryloxy wherein each of the C 3 -6 cycloalkyl, Cm alkyl, aryl, or aryloxy groups may 
be further optionally substituted by up to four substituents in any position 
independently selected from CF3, CCI3, Me, N 0 2 , OH, OMe, OEt, CONR'R , 
NR 5 R 6 , NHCOCH3, OCF 3 , SMe, COOR 7 , SO3R 7 , S 0 2 NR 5 R 6 , COMe, COEt, CO- 
lower alkyl, SCF 3 , CN, C 2 -* alkenyl, H, halogens, Cm alkoxy, C 3 ^ cycloalkyl, Cm 
alkyl, and aryl; 

R 5 and R 6 are independently a H, or Cm alkyl, or C 2 . 6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
substituted by up to four substituents in any position independently selected from 
CF 3 , CCI3, Me, N 0 2j OH, OMe, OEt, CONR 7 R g , NR 7 R 8 , NHCOCH3, OCF 3 , SMe. 
COOR 9 , S 0 3 R 7 , S 0 2 NR 7 R 8 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, C 2 . 6 alkenyl, 
H, halogens, C M alkoxy, C3.6 cycloalkyl, Cu 6 alkyl, and aryl wherein each of the 
C3.6 cycloalkyl, Cm alkyl, or.aryl groups may be further optionally substituted by 
up to four substituents in any position independently selected from CF3, CC 1 3 , Me. 
N 0 2 , OH, OMe, OEt, CONR 8 R 9 , NR 8 R 9 , NHCOCH3, OCF 3 . SMe. COOR 7 . 
S02NR 8 R 9 , SO3R 7 , COMe, COEt, CO-lower alkyl, SCF 3 , CN, Cm alkenyl. H. 
halogens. Cm alkoxy, C3-6 cycloalkyl. Cm alkyl, and aryl, 

or R 5 and R 6 may form part of a 5, 6 or 7 membered cyclic structure which 
may be either saturated or unsaturated and that may contain up to four heteroatoms 
selected from O, N or S and said cyclic structure may be optionally substituted by up 
to four substituents in any position independently selected from CF3, CCI3. Me. 
N 0 2 , OH, OMe, OEt, OCF 3 , SMe, COOR 7 , S 0 2 NR 8 R 9 , SO3R 7 , NHCOCH5, COEt. 
COMe. or halogen; 

R 7 may be independently selected from H or Cm alkyl; 

R 8 and R 9 are independently a H. or Cm alkyl, or C 2 .6 alkenyl, or 
cycloalkyl, or aryl, or CH 2 aryl group and each said group may be optionally 
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substituted by up to four substituents in any position independently selected from 
halogen, CF 3 , OCF3, OEt, CCI 3 , Me, N0 2 , OH, OMe, SMe, COMe, CN, COOR 7 . 
SO 3 R 7 , COEt, NHCOCH 3 , or aryl; 

an aryl moiety can be a 5 or 6 membered aromatic heterocyclic ring (containing up 
5 to 4 hetero atoms independently selected from N, O, or S) or a 6 membered aromatic non- 
heterocyclic ring or a polycycle; 

with the proviso that said compound is not: 

N-[3-(4-bromo- 1 -methylpyrazoi-3-yI)phenyl] [methylaminojcarboxamide, or 
N-[3-(4-bromo- 1 -methylpyrazol- 3 -yl)phenyl][ {(4-trifluoromethoxy)phenyl } amino] 

10 carboxamide, or 

N-[3<4-bromo-l-methylpyrazol-3-yl)phenyl][2-chlorophenyl]carboxamide, or 
N-[3-(4-bromo- 1 -methylpyrazol-3-yl)phenyl] [2-chloro-3-pyridyl]carboxamide, or 
N-[3-(4-bromo-l-methylpyrazoI-3-yl)phenyl][trichloromethyl]carboxamide. 

20. The use of a compound of claim 19 for the manufacture of a medicament. 

15 
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FIGURE 3A 

ATGGATATTCTTTGTGAAGAAAATACn Cl 1 1GAGCTCAACTACGAACTCCCTAATGCAATTA 

AATGATGACAACAGGCTCTACAGTAATGACTTTAACTCCGGAGAAGCTAACACTTCTGATGCA 

TTTAACTGGACAGTCGACTCTGAAAATCGAACCAACCTTTCCTGTGAAGGGTGCCTCTCACCG 

TCGTGTCTCTCCTTACTTCATCTCCAGGAAAAAAACTGGTCTGCnTACTGACAGCCGTAGTGA 

TTATTCTAACTATTGCrGGAAACATACTCGTCATCATGGCAGTGTCCCTAGAGAAAAAGCTGC 

AGAATGCCACCAACTATTTCCTGATGTCACTTGCCATAGCTGATATGCTGCTGGGTTTCCTTGT 

CATGCCCGTGTCCATGTTAACCATCCTGTATGGGTACCGGTGGCCTCTGCCGAGCAAGCTTTGT 

GCAGTCTGGATTTACCTGGACGTGCTCTTCTCCACGGCCTCCATCATGCACCTCTGCGCCATCT 

CGCTGGACCGCTACGTCGCCATCCAGAATCCCATCCACCACAGCCGCTTCAACTCCAGAACTA 

AGGCATTTCTGAAAATCATTGCrGTTTGGACCATATCAGTAGGTATATCCATGCCAATACCAG 

TCnTGGGCTAGAGGACGATTCGAAGGTCnTAAGGAGGGGAGTTGCTTACTCGCCGATGATA 

ACTTTGTCCTGATCGGC 1 C 1 1 1 1 G 1 G JCATI T lTCA TTCCCTTAACCATCATGGTGATCACCrAC 

TTTCTAACTATCAAGTCACTCCAGAAAGAAG€TACTTTGTGTGTAAGTGATCTTGGCACACGG 

GCCAAATTAGCTTCnTCAGCTrCCTCCCTCAGAGTTCmGTCTTCAGAAAAGCTCTTCCAGC 

GGTCGATCCATAGGGAGCCAGGGTCCrACACAGGCAGGAGGACTATGCAGTCCATCAGCAAT 

GAGCAAAAGGCATGCAAGGTGCrGGGCATCGTCTTCITCCTGnTGTGGTGATGTGGTGCCCT 

TTCTTCATCACAAACATCATGGCCGTCATCTGCAAAGAGTCCTGCAATGAGGATGTCATTGGG 




CACTGTTCAACAAGACCTATAGGTGAGC Cn II CA CGGTATATTCAGTGTCAGTACAAGGAAA 
ACAAAAAACCATTGCAGTTAATTTTAGTGAACACAATACCGGCTn GGCCTACAAGTCTAGCC 
AACTTCAAATGGGACAAAAAAAGAATTCAAAGCAAGATGCCAAGACAACAGATAATGACTGC 
TCAATGGTTGCTCTAGGAAAGCAGIATTCTGAAGAGGCrTCTAAAGACAATAGCGACGGAGT 
GAATGAAAAGGTGAGCTGTGTGTGA 
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FIGURE 3B 

MDn.CEENTSLSSTTNSLMQLM3DCfRLYSNDFNSGEANTSDAFNWTVDSHNRTNLSCEGCLSPSCL 

SIIin.QEKNWSAIiTAVVIILTIAGNILVIMAVSLEKXLQNATNYFLMSLAIADMLLGFLVMPVSM 

lulygyrwplpsklcavwiyldvlfstasimhlcaisudryvaiqnpihhsrfnsrtkaflkiiavw 

TISVGISMFIPVFGLQDDSKVFKEGSCLLADDNFVLIGSFVSFnPLTIMVITYFLTIKSLQKEAILCVS 

dlgtrakij^shjqssi^sekijqrsihrepgsytgrrtmqsis^qkacb^givfflfvvmwc 

PFFITNIMAVICKESCNn)VIGALLNVFVWIGYLSSAVNPLVYTLFNKTYRSAFSRYIQCQYKENKK 

plqulvntipalaykssqlqmgqkknskqdakttdndcsmvalgkqyseeaskdnsdgynekv 

scv 


FIGURE 4B 

MVNLRNAVHS FLVHLIGLLVWQCD ISVS PVRATVTDIFNTSDGGRFKFPDGVQNWPALS IVI 1 1 IMTIGG2J 

ILVIMAVSMEKIttiHNMinfFLMSLAXADMLVGLLVMPIiSIiLAIL'XDYVWPIiPRYIiCPVWISLDVLFSTAS I 

MEjCAISLDRYVAIHOTIEESRFNSRTKAXMKIAIVVIAISIGVSVPIPVIGUIDEEKVTVNIITTCVLIIDPN 

FVLIGSFVAFFIPLTIMVTT£CLTIYVLRRQALMLLHGHTEEPPGIiSLDFLKCCKRNTAEEENSAITPNQDQ 

NARRRKK2CERRPRGTMQA2iniERKASKVMIVFFV7LIMWCPF5'ITIIIXiSVLCEKSCNQKLMEKIiLNVFVW 

IGYVCSGIOTLVYTLFNKIYRRAFSirmiCNYICVEKKPFVRQIPRVAATALSGREIJIVNXYRHTNEPVTEK 

ASDHEPGIEMQVEHLELPVNPSSWSERISSV 
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FIGURE 4A 

ATGGTGAACCTGAGGAATGCGGTGCATTCATTCCTTGTGCACCTAATTGGCCTATTGGnTGGC 

aatgtgatatttctgtgagcccagtagcagctatagtaactgacattttcaatacctccgatg 

GTGGACGCTTCAAATTCCCAGACGGGGTACAAAACTGGCCAGCACnTCAATCGTCATGATAA 

TAATCATGACAATAGGTGGCAACATCCTTGTGATCATGGCAGTAAGCATGGAAAAGAAACTG 

CACAATGCCACCAATTACTTCTTAATGTCCCTAGCCATTGCTGATATGCTAGTGGGACTACTTG 

TCATGCCCCTGTCTCTCCTGGCAATCCTTTATGATTATGTCTGGCCACTACCTAGATATTTGTG 

CCCCGTCTGGATTTCTTTAGATGTTTTATTTTCAACAGCGTCCATCATGCACCTCTGCGCTATAT 

CGCTGGATCGGTATGTAGCAATACGTAATCCrATTGAGCATAGCCGTTTCAATTCGCGGACTA 

AGGCCATCATGAAGATTGCTAi iui 1 IGGGCAAl i i IATAGGTGTATCAu i iCCTATCCCTGT 

GATTGGACTGAGGGACGAAGAAAAGGTu i'i CGTGAACAACACGACGTGCGTGCTCAACGACC 
CAAATTTC G ' nLTrA TTGGGTCCTTCGTAGCl i rCTTCATACCGCTGACGATTATGGTGATTAC 

GTATTGCCTGACCATCTACGTTCTGCGCCGACAAGCrTTGATGTTACTGCACGGCGACACCGA 

GGAACCGCCTGGACTAAGTCTGGATTTCCTGAAGTGCTGCAAGAGGAATACGGCCGAGGAAG 

AGAACTCTGCAAACCCTAACCAAGACCAGAACGCACGCCGAAGAAAGAAGAAGGAGAGACG 

TCCTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTTCGAAAGTCCTTGGGATTG 

TTTTCITTGTGTrTCTGATCATGTGGTGCCCATnTTCATTACCAATATTCTGTCTGTTCTTTGTG 

AGAAGTCCTGTAACCAAAAGCrCATGGAAAAGCITCTGAATGTu i i lux i i GGATTGGCTATG 

TTTGTTGAGGAATCAATCCTCTGGTGTATACTCTGTTCAACAAAATTTACCGAAGGGCATTCTC 

CAACTATTTGCGTTGCAATTATAAGGTAGAGAAAAAGCCTCCTGTCAGGCAGATTCGAAGAGT 

TGCCGCCACTGCTTTGTCTGGGAGGGAGCTTAATGTrAACATTTATCGGCATACGAATGAACC 

GGTGATCGAGAAAGCCAGTGACAATGAGCCCGGTATAGAGATGCAAGTTGAGAATTTAGAGT 

TACCAGTAAATCCCTCCAGTGTGGTTAGCGAAAGGATTAGCAGTGTGTGA 
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FIGURE SA 

atggtgaacctgaggaatgcggtgcattcattccttgtgcacctaattggcctattggtttggcaat 

GTGATATTTCTGTGAGCCCAGTAGCAGCTATAGTAACTGACATTTTCAATACCTCCGATGGTGGACG 

CrTCAAATTCCCAGACGGGGTACAAAACTGGCCAGCACmCAATCGTCATCATAATAATCATGAC 

AATAGGTGGCAACATCCTTGTGATCATGGCAGTAAGCATGGAAAAGAAACTGCACAATGCCACCA 

ATTACTTCTTAATGTCCCTAGCCATTGCTGATATGCTAGTGGGACTACTTGTCATGCCCCTGTCTCTC 

CTGGCAATCCTTTATGAnATGTCTGGCCACTACCrAGATATTTGTGCCCCGTCTGGATTTCnTAGA 

TGTTTTATTTTCAACAGCGTCCATCATGCACCTCTGCGCTATATCGCTGGATCGGTATGTAGCAATA 

CGTAATCCTATTGAGCATAGCCGTTTCAATTCGCGGACTAAGGCCATCATGAAGATTGCTATTGTTT 

GGGCAATTTCTATAGGTGTATCAGTTCCTATCCCTGTGATTGGACTGAGGGACGAAGAAAAGGTGT 

TCGTGAACAAGACGACGTGCGTGCTCAACGACCCAAATTrCGTTCTTATTGGGTCCTTCGTAGCm 

CTTCATACCGCTGACGATTATGGTGATTACGTATTGCCTGACCATCTACGTTCTGCGCCGACAAGCT 

TTGATGTTACTGCACGGCCACACCGAGGAACCGCCrGGACTAAGTCTGGATnCCTGAAGTGCTGC 

AAGAGGAATACGGCCGAGGAAGAGAACTCTGCAAACCCTAACCAAGACCAGAACGCACGCCGAA 

GAAAGAAGAAGGAGAGACGTCCTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTAA 



GATIGGCTATGTTTGTTCAGGAATCAATCCTCTGGTGTATACTCTGTrCAACAAAATTTACCGAAGG 
GCATrCTCCAACTATTTGCGTrGCAATTATAAGGTAGAGAAAAAGCCTCCTGTCAGGCAGATTCCA 
AGAGTTGCCGCCAC TOCl IT GI C I GGGAGGGAGCTTAATGTTAACATTTATCGGCATACCAATGAA 
CCGGTGATCGAGAAAGCCAGTGACAATGAGCCCGGTATAGAGATGCAAGTTGAGAAnTAGAGTT 
ACCAGTAAATCCCTCCAGTGTGGTTAGCGAAAGGATTAGCAGTGTGTGA 
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FIGURE SB 

MVNUlNAVHSFLVHLIGLLVWQCDISVSPVAAT/TDIFNTSDGGRfKFPDGVQNWPALSrVIinMTI 

GGNn-VlMAVSMEKKLHNATNYFLMSLAIADMLVGLLVMPLSLLAILYDYVWPLPRYLCTVWISL 

DVLFSTASIMHLCAISLDRYVAIRNPIEHSRFNSR'TKAIMKIArVWAISIGVSVPIPVIGLRDEEKVFV 

NNTrOTNDPNFVLIGSFVAFFIPLTIMVITyCLTTyVLRR.QALMLLHGHTEEPPGLSLDFLKCQECRN 

TAEEENSANPNQDQNARRRKKKERRPRGTMQAINNERKAIPwVLGIVPFVTIJMWCPFFITNILSVL 

C^KSCNQFXMEKLIJfVTVWIGYVCSGINPLVYIIJNKryRRAFSNYIJia^YKVEKKPPVRQIPRV 

AATALSGRELNVNIYRHTNEPVIEKASDNEPGIEMQVENLELPV^IPSSWSERISSV. 

FIGURE 6B 

MDILCEHNTSLSSTTNSLMQLNDDNRLySNDFNSGEANTSDAFNWTVDSENRTNLSCEGCLSPSCL 

SLLHLQEKNWSALLTAWIILHAGNILVIMAVSLEKKLQNATNYFLMSLAIADMLLGFLVMPVSM 

LTILYGYRWPLPSKLCAVWIYLDVLFSTASIMHLCAISLDRYVAIQNPIHHSRFNSRTKAFLKIIAVW 

TISVGISMPIPVFGLQDDSKVFKEGSCLLADDNFVIIGSFVSFFIPL'nMVITYTLTIKVLRRQALMLL 

HGHTEEPPGLSLDFLKCCKRNTAEEENSANPNQDQNARRRKKKERRPRGTMQAINNERKAS 

KVLGIVPRJFWMW CPFFITMMA VICKESChCT VIGAIXNVFVWIGYLSS A VNPL VYTLFNKIYR 

RAFSNYLRCNYKVEKKPPVRQIPRVAATALSGIIELNVNIYRHTNEPVIEKASDNEPGIEMQVE 

NLELPVNPSSWSERISSV 


FIGURE 6C 



SUBSTITUTE SHEET (RULE 26) 



WO 99/52927 


PCT/US99/08I68 


8/19 

FIGURE 6A 

ATGGATATTCTTTGTGAAGAAAATACTTCTTTGAGCTCAACTACGAACrCCCTAATGCAATTA 

AATGATGACAACAGGCrCTACAGTAATGACTTTAACrCCGGAGAAGCTAACACTTCTGATGCA 

nTAACTGGACAGTCGACTCTGAAAATCGAACCAACCTTTCCTGTGAAGGGTGCCTCTCACCG 

TCGTGTCTCTCCTTACTTCATCTCCAGGAAAAAAACTGGTCTGCTTTACTGACAGCCGTAGTGA 

TTATTCTAACTATTGCTGGAAACATACTCGTCATCATGGCAGTGTCCCTAGAGAAAAAGCTGC 

AGAATGCCACCAACTAnTCCTGATGTCACTTGCCATAGCTGATATGCTGCTGGGTTTCCTTGT 

CATGCCCGTGTCCATGTTAACCATCCTGTATGGGTACCGGTGGCCTCTGCCGAGCAAGCnTGT 

GCAGTCTGGATTTACCTGGACGTGCrCTTCTCCACGGCCrCCATCATGCACCTCTGCGCCATCT 

CGCTGGACCGCTACGTCGCCATCGAGAATCCCATCCACCACAGCCGCTTCAACTCCAGAACTA 

AGGCATTTCTGAAAATCATTGCTGTTTGGACCATATCAGTAGGTATATCCATGCCAATACCAG 

TCTTTGGGCTACAGGACGATTCGAAGGTCTTTAAGGAGGGGAGTTGCTTACTCGCCGATGATA 

ACTTTGTCCTGATCGGCTCTnTGTGTCATTnTCATTCCCTTAACCATGATGGTGATCACCTAC 

TTTCTAACTATCAAGGTTCTGCGCCGACAAGCTTTGATGTTACTGCACGGCCACACCGAG 

GAACCGCCTGGACTAAGTCTGGATTTCCTGAAGTGCTGCAAGAGGAATACGGCCGAGGA 

AGAGAACTCTGCAAACCCTAACCAAGACCAGAACGCACGCCGAAGAAAGAAGAAGGAG 

AGACGTCCTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTTCGAAGGTACT 

GGGCATCGTCTTCTTCCTGTTTGTGGTGATGTGGTGCCCTTTCTTCATCACAAACATCATGGCC 

GTCATCTGGAAAGAGTCCTGCAATGAGGATGTCATTGGGGCCCTGCTCAATGTGTTTGTTTGG 

ATCGGTTATCTCTCTTCAGCAGTCAACCCACTAGTCTATACTCTGTTCAACAAAATTTACCGA 

AGGGCATTCTCCAACTATTTGCGTTGCAATTATAAGGTAGAGAAAAAGCCTCCTGTCAG 

GCAGATTCCAAGAGTTGCCGCCACTGCTTTGTCTGGGAGGGAGCTTAATGTTAACATTT 

ATCGGCATACCAATGAACCGGTGATCGAGAAAGCCAGTGACAATGAGCCCGGTATAGAG 

ATGCAAGTTGAGAATTTAGAGTTACCAGTAAATCCCTCCAGTGTGGTTAGCGAAAGGAT 

TAGCAGTGTGTGA 
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FIGURE 7A 


ATGGATATTCTTTGTGAAGAAAATAC OCT l T GAGCT CAACTACGAACTCCCTAATGCAATTA 

AATGATGACAACAGGCTCTACAGTAATGACnTAACTCCGGAGAAGCTAACACTTCIGATGCA 

nTAACTGGACAGTCGACTCTGAAAATCGAACCAACCmCCTGTGAAGGGTGCCTCTCACCG 

TCGTGTCTCTCCTTACTTCATCTCCAGGAAAAAAACTGGTCTGCTTTACTGACAGCCGTAGTGA 

TTATTCTAACTATTGCTGGAAACATACTCGTCATCATGGCAGTGTCCCTAGAGAAAAAGCTGC 

AGAATGCCACCAACTATTTCCTGATGTCACTTGCCATAGCTGATATGCTGCTGGGTn'CCTTGT 

CATGCCCGTGTCCATGTTAACCATCCTGTATGGGTACCGGTGGCCTCTGCCGAGCAAGCTnGT 

GCAGTCrGGATTTACCTGGACGTGCTCTTCTCCACGGCCTCCATCATGCACCrCTGCGCCATCT 

CGCTGGACCGCTACGTCGCCATCCAGAATCCCATCCACCACAGCCGCTTCAACTCCAGAACTA 

AGGCATTTCTGAAAATCATTGCTGTTTGGACCATATCAGTAGGTATATCCATGCCAATACCAG 

TCTTTGGGCTACAGGACGATTCGAAGGTCTTTAAGGAGGGGAGTTGCTTACTCGCCGATGATA 


attgcctgaccatctacgttctgcgccgac aagctttgatgttactgcacggccacacc 

gaggaaccggctggactaagtctggatttg ctgaagtgctgcaagaggaatacggccga 

GGAAGAGAACTCTGCAAACCCTAACCAAGAC CAGAACGCACGC-CGAAGAAAGAAGAAG 

gagagacgtcctaggggcaccatggagggtatcaacaatgaaagaaaagctaagaaagt 


TGGCCGTCATCTGCAAAGAGTCCTGCAATGAGGATGTCATTGGGGCCCTGCTCAATGTGmG 

TTTnflATG nr iTT ATrT m -TT r ACyAfTrrAArGGACTrAGTCT ATACTCTGTTCAAC A AAATTT 

ACCGAAGGGCATTCTCCAACTATTTGCGT TGCAATTATAAGGTAGAGAAAAAGCCTCCT 

GTCAGGCAGA TTCCAAGAGTTGCCGCCACTGCnTGTCTGGGAGGGAGCTTAATGmA 

CAnTATCGG GATACCAATGAACCGGTGATCGAGAAAGCCAGTGACAATGAGCCCGGIA 

TAGAGATGCAAGTTGAGAA TTTAGAGTTACCAGTAAATCCCTCCAGTGTGGTTAGCGAA 

AGGATTASCAOTGESIfiA 
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FIGURE 7B 

MDrLCEENTSLSSTTNSLMQLNDDCfRLYSNDFNSGEANTSDAFNWTVDSENRTNLSCEGCLSPSCL 
SLLHLQEKNWSALLTAWnLTlAGNILVIMAVSLEiQKLQNATNYFLMSLAIADMLLGFLYMPVSM 
LHLYGYRWPLPSKLCAVWIYLDVLFSTAS1MHLCAISLDRYVAIQNPIHHSRFNSIITKAFLKIIAVW 
TISVGISMPIPVFGLODDSKVFKEGSCLLADDNFVLIGSFVSFFIPLTIMVrTY CLTrWLRRf) ALML 

lhghteeppg ^sldflkcckrntaeeensanpnoponar rrkkkerrprgtmoainnerka 

ATKVLCTVFFVFLIMWCP FFinflMAVICKESCNED VIGALLNVFVWIGYLSSAVNPT .VY TT FNICIY 
RRAFSNYLRC NYKVEKKPPVROIPRVAATALSGRELNVNIYRHTNEPVTI SKASPWEPGrEMOV 
ENLELPVNPSS WSERISSV 
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5-HT2A 


FIGURE 12 
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SEQUENCE LISTING 


(1) GENERAL INFORMATION: 

(i) APPLICANTS: Arena Pharmaceuticals, Inc. and Tripos, Inc. 


(ii) TITLE OF INVENTION: Non-Endogenous , Constitutively Activated 

Human Serotonin Receptors and Small Molecule Modulators Thereof 

(iii ) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Woodcock Washburn Kurtz Mackiewicz & 

Norris LLP 

(B) STREET: One Liberty Place - 46th Floor 

(C) CITY: Philadelphia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19103 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: WINDOWS NT, Version #4.0 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US99/08168 

(B) FILING DATE: April 14, 1999 

(C) CLASSIFICATION: 435 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Mark J. Rosen 

(B) REGISTRATION NUMBER: 39,822 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (215) 568-3100 

(B) TELEFAX: (215) 568-3439 


(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GACCTCGAGG TTGCTTAAGA CTGAAGC 27 

(3) INFORMATION FOR SEQ ID NO: 2: 
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( i I SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATTTCTAGAC ATATGTAGCT TGTACCG 27 

(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTAGGGGCAC CATGCAGGCT ATCAACAATG AAAGAAAAGC TAAGAAAGTC 50 

(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CAAGGACTTT CTTAGCTTTT CTTTCATTGT TGATAGCCTG CATGGTGCCC 50 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GACCTCGAGT CCTTCTACAC CTCATC 26 

(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TGCTCTAGAT TCCAGATAGG TGAAAACTTG 30 

(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CAAAGAAAGT ACTGGGCATC GTCTTCTTCC T 31 

(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CCGCTCGAGT ACTGCGCCGA CAAGCTTTGA T 31 

(10) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 


(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGATGCCCAG CACTTTCGAA GCTTTTCTTT CATTGTTG 
(11) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AAAAGCTTCG AAAGTGCTGG GCATCGTCTT CTTCCT 

(12) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TGCTCTAGAT T C CAGATAGG T GAAAACTT G 


(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGTGTCTCTC CTTACTTCA 

(14) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TCGGCGCAGT ACTTTGATAG TTAGAAAGTA GGTGAT 36 

(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TTCTAACTAT CAAAGTACTG CGCCGACAAG CTTTGATG 38 

(16) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TTCAGCAGTC AACCCACTAG TCTATACTCT GTTCAACAAA ATT 43 

(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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ATTTCTAGAC ATATGTAGCT TGTACCGT 

(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
ATCACCTACT TTCTAACTA 

(19) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCATAATCGT CAGGGGAATG AAAAATGACA CAA 

(20) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATTTTTCATT CCCCTGACGA TTATGGTGAT TAC 

(21) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TGATGAAGAA AGGGCACCAC ATGATCAGAA ACA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GATCATGTGG TGCCCTTTCT TCATCACAAA CAT 

(23) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GAGACATATT ATCTGCCACG GAGG 

(24) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
TTGGCATAGA AACCGGACCC AAGG 

(25) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D! TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 


ATGGATATTC 

TTTGTGAAGA 

AAATACTTCT 

TTGAGCTCAA 


CTACGAACTC 

CCTAATGCAA 



60 

TTAAATGATG 

ACAACAGGCT 

CTACAGTAAT 

GACTTTAACT 


CCGGAGAAGC 

TAACACTTCT 



120 

GATGCATTTA 

ACTGGACAGT 

CGACTCTGAA 

AATCGAACCA 


ACCTTTCCTG 

TGAAGGGTGC 



180 

CTCTCACCGT 

CGTGTCTCTC 

CTTACTTCAT 

CTCCAGGAAA 


AAAACTGGTC 

TGCTTTACTG 



240 

ACAGCCGTAG 

TGATTATTCT 

AACTATTGCT 

GGAAACATAC 


TCGTCATCAT 

GGCAGTGTCC 



300 

CTAGAGAAAA 

AGCTGCAGAA 

TGCCACCAAC 

TATTTCCTGA 


TGTCACTTGC 

CATAGCTGAT 



360 

ATGCTGCTGG 

GTTTCCTTGT 

CATGCCCGTG 

TCCATGTTAA 


CCATCCTGTA 

TGGGTACCGG 



420 

TGGCCTCTGC 

CGAGCAAGCT 

TTGTGCAGTC 

TGGATTTACC 


TGGACGTGCT 

CTTCTCCACG 



480 

GCCTCCATCA 

TGCACCTCTG 

CGCCATCTCG 

CTGGACCGCT 


ACGTCGCCAT 

CCAGAATCCC 



540 

ATCCACCACA 

GCCGCTTCAA 

CTCCAGAACT 

AAGGCATTTC 


TGAAAATCAT 

TGCTGTTTGG 



600 

ACCATATCAG 

TAGGTATATC 

CATGCCAATA 

CCAGTCTTTG 


GGCTACAGGA 

CGATTCGAAG 



660 

GTCTTTAAGG 

AGGGGAGTTG 

CTTACTCGCC 

GATGATAACT 


TTGTCCTGAT 

CGGCTCTTTT 



720 

GTGTCATTTT 

TCATTCCCTT 

AACCATCATG 

GTGATCACCT 


ACTTTCTAAC 

TATCAAGTCA 



780 

CTCCAGAAAG 

AAGCTACTTT 

GTGTGTAAGT 

GATCTTGGCA 


CACGGGCCAA 

ATTAGCTTCT 



840 

TTCAGCTTCC 

TCCCTCAGAG 

TTCTTTGTCT 

TCAGAAAAGC 


TCTTCCAGCG 

GTCGATCCAT 



900 

AGGGAGCCAG 

GGTCCTACAC 

AGGCAGGAGG 

ACTATGCAGT 


CCATCAGCAA 

TGAGCAAAAG 



960 

GCATGCAAGG 

TGCTGGGCAT 

CGTCTTCTTC 

CTGTTTGTGG 


TGATGTGGTG 

CCCTTTCTTC 



1020 

AT CACAAAC A 

TCATGGCCGT 

CATCTGCAAA 

GAGTCCTGCA 


ATGAGGATGT 

CATTGGGGCC 



1080 
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CTGCTCAATG 

TGTTTGTTTG 

GATCGGTTAT 

CTCTCTTCAG 


CAGTCAACCC 

ACTAGTCTAC 



1140 

ACACTGTTCA 

ACAAGACCTA 

TAGGTCAGCC 

TTTTCACGGT 


ATATTCAGTG 

TCAGTACAAG 



1200 

GAAAACAAAA 

AACCATTGCA 

GTTAATTTTA 

GTGAACACAA 


TACCGGCTTT 

GGCCTACAAG 



1260 

TCTAGCCAAC 

TTCAAATGGG 

ACAAAAAAAG 

AATTCAAAGC 


AAGATGCCAA 

GACAACAGAT 



1320 

AATGACTGCT 

CAATGGTTGC 

TCTAGGAAAG 

CAGTATTCTG 


AAGAGGCTTC 

TAAAGACAAT 



1380 

AGCGACGGAG 

TGAATGAAAA 

GGTGAGCTGT 

GTGTGA 

1416 


(26) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 


Met 

Asp 

He 

Leu 

Cys 

Glu 

Glu 

Asn 

Thr 

Ser 

Leu 

Ser 

Ser 

Thr 

Thr 

Asn 

1 



5 





10 





15 


Ser 

Leu 

Met 

Gin 

Leu 

Asn 

Asp 

Asp Asn 

Arg 

Leu 

Tyr 

Ser 

Asn 

Asp 

Phe 




20 





25 





30 



Asn 

Ser 

Gly 

Glu 

Ala 

Asn 

Thr 

Ser Asp 

Ala 

Phe 

Asn Trp 

Thr 

Val Asp 



35 





40 





45 




Ser 

Glu 

Asn 

Arg 

Thr 

Asn 

Leu 

Ser 

Cys 

Glu 

Gly 

Cys 

Leu 

Ser 

Pro 

Ser 


50 





55 





60 





Cys 

Ser 

Leu 

Leu 

His 

Leu 

Gin 

Glu 

Lys 

Asn 

Trp 

Ser 

Ala 

Leu 

Leu 

Thr 

65 





70 





75 





80 

Ala 

Val 

Val 

lie 

lie 

Leu 

Thr 

lie 

Ala 

Gly Asn 

lie 

Leu 

Val 

lie 

Met 





85 





90 





95 


Ala 

Val 

Ser 

Leu 

Glu 

Lys 

Lys 

Leu 

Gin 

Asn 

Ala 

Thr 

Asn 

Tyr 

Phe 

Leu 




100 





105 





110 



Met 

Ser 

Leu 

Ala 

lie 

Ala 

Asp 

Met 

Leu 

Leu 

Gly Phe 

Leu 

Val 

Met 

Pro 



115 





120 





125 




Val 

Ser 

Met 

Leu 

Thr 

lie 

Leu 

Tyr 

Gly 

Tyr Arg Trp 

Pro 

Leu 

Pro 

Ser 


130 





135 





140 





Lys 

Leu 

Cys 

Ala 

Val 

Trp 

lie 

Tyr 

Leu 

Asp 

Val 

Leu 

Phe 

Ser 

Thr 

Ala 

145 





150 





155 





160 
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Ser 

lie 

Met 

His 

Leu 

165 

Cys 

Ala 

Gin 

Asn 

Pro 

lie 

180 

His 

His 

Ser 

Leu 

Lys 

lie 

195 

lie 

Ala 

Val 

Trp 

He 

Pro 

210 

Val 

Phe 

Gly 

Leu 

Gin 

215 

Ser 

225 

Cys 

Leu 

Leu 

Ala 

Asp 

230 

Asp 

Ser 

Phe 

Phe 

lie 

Pro 

245 

Leu 

Thr 

lie 

Lys 

Ser 

Leu 

260 

Gin 

Lys 

Glu 

Thr 

Arg 

Ala 

275 

Lys 

Leu 

Ala 

Ser 

Ser 

Ser 

290 

Glu 

Lys 

Leu 

Phe 

Gin 

295 

Tyr 

305 

Thr 

Gly Arg 

Arg 

Thr 

310 

Met 

Cys 

Lys 

Val 

Leu 

Gly 

325 

lie 

Val 

Pro 

Phe 

Phe 

lie 

340 

Thr 

Asn 

lie 

Asn 

Glu 

Asp 

355 

Val 

lie 

Gly Ala 

Tyr 

Leu 

370 

Ser 

Ser 

Ala 

Val 

Asn 

375 

Thr 

385 

Tyr 

Arg 

Ser 

Ala 

Phe 

390 

Ser 

Asn 

Lys 

Lys 

Pro 

Leu 

405 

Gin 

Leu 

Ala 

Tyr 

Lys 

Ser 

420 

Ser 

Gin 

Leu 

Gin 

Asp 

Ala 

435 

Lys 

Thr 

Thr Asp 

Lys 

Gin 

450 

Tyr 

Ser 

Glu 

Glu 

Ala 

455 


Glu Lys Val Ser Cys Val 
465 470 

(27) INFORMATION FOR SEQ ID N0:26: 


lie 

Ser 

Leu 

Asp- Arg 

Tyr 

Val 

Ala 

He 



170 





175 


Arg 

Phe 

Asn 

Ser 

Arg 

Thr 

Lys 

Ala 

Phe 


185 





190 



Thr 

lie 

Ser 

Val 

Gly 

lie 

Ser 

Met 

Pro 

200 





205 




Asp 

Asp 

Ser 

Lys 

Val 

Phe 

Lys 

Glu 

Gly 





220 





Asn 

Phe 

Val 

Leu 

lie 

Gly 

Ser 

Phe 

Val 




235 





240 

lie 

Met 

Val 

lie 

Thr 

Tyr 

Phe 

Leu 

Thr 



250 





255 


Ala 

Thr 

Leu 

Cys 

Val 

Ser 

Asp 

Leu 

Gly 


265 





270 



Phe 

Ser 

Phe 

Leu 

Pro 

Gin 

Ser 

Ser 

Leu 

280 





285 




Arg 

Ser 

lie 

His 

Arg 

Glu 

Pro 

Gly 

Ser 





300 





Gin 

Ser 

lie 

Ser 

Asn 

Glu 

Gin 

Lys 

Ala 




315 





320 

Phe 

Phe 

Leu 

Phe 

Val 

Val 

Met 

Trp 

Cys 



330 





335 


Met 

Ala 

Val 

He 

Cys 

Lys 

Glu 

Ser 

Cys 


345 





350 



Leu 

Leu 

Asn 

Val 

Phe 

Val 

Trp 

lie 

Gly 

360 





365 




Pro 

Leu 

Val 

Tyr 

Thr 

Leu 

Phe 

Asn 

Lys 





380 





Arg Tyr 

lie 

Gin 

Cys 

Gin 

Tyr 

Lys 

Glu 




395 





400 

lie 

Leu 

Val 

Asn 

Thr 

lie 

Pro 

Ala 

Leu 



410 





415 


Gin 

Met 

Gly 

Gin 

Lys 

Lys 

Asn 

Ser 

Lys 


425 





430 



Asn Asp 

Cys 

Ser 

Met 

Val 

Ala 

Leu 

Gly 

440 





445 




Ser 

Lys 

Asp 

Asn 

Ser 

Asp 

Gly 

Val 

Asn 


460 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 


ATGGTGAACC 

TGAGGAATGC 

GGTGCATTCA 

TTCCTTGTGC 

ACCTAATTGG 

CCTATTGGTT 

60 

TGGCAATGTG 

ATATTTCTGT 

GAGCCCAGTA 

GCAGCTATAG 

TAACTGACAT 

TTTCAATACC 

120 

TCCGATGGTG 

GACGCTTCAA 

ATTCCCAGAC 

GGGGTACAAA 

ACTGGCCAGC 

ACTTTCAATC 

180 

GTCATCATAA 

TAATCATGAC 

AATAGGTGGC 

AACATCCTTG 

TGATCATGGC 

AGTAAGCATG 

240 

GAAAAGAAAC 

TGCACAATGC 

CACCAATTAC 

TTCTTAATGT 

CCCTAGCCAT 

TGCTGATATG 

300 

CTAGTGGGAC 

TACTTGTCAT 

GCCCCTGTCT 

CTCCTGGCAA 

TCCTTTATGA 

TTATGTCTGG 

360 

CCACTACCTA 

GATATTTGTG 

CCCCGTCTGG 

ATTTCTTTAG 

ATGTTTTATT 

TTCAACAGCG 

420 

TCCATCATGC 

ACCTCTGCGC 

TATATCGCTG 

GATCGGTATG 

TAGCAATACG 

TAATCCTATT 

480 

GAGCATAGCC 

GTTTCAATTC 

GCGGACTAAG 

GCCATCATGA 

AGATTGCTAT 

TGTTTGGGCA 

540 

ATTTCTATAG 

GTGTATCAGT 

TCCTATCCCT 

GTGATTGGAC 

TGAGGGACGA 

AGAAAAGGTG 

600 

TTCGTGAACA 

ACACGACGTG 

CGTGCTCAAC 

GACCCAAATT 

TCGTTCTTAT 

TGGGTCCTTC 

660 

GTAGCTTTCT 

TCATACCGCT 

GACGATTATG 

GTGATTACGT 

ATTGCCTGAC 

CATCTACGTT 

720 

CTGCGCCGAC 

AAGCTTTGAT 

GTTACTGCAC 

GGCCACACCG 

AGGAACCGCC 

TGGACTAAGT 

780 

CTGGATTTCC 

TGAAGTGCTG 

CAAGAGGAAT 

ACGGCCGAGG 

AAGAGAACTC 

TGCAAACCCT 

840 

AACCAAGACC 

AGAACGCACG 

CCGAAGAAAG 

AAGAAGGAGA 

GACGTCCTAG 

GGGCACCATG 

900 

CAGGCTATCA 

ACAATGAAAG 

AAAAGCTTCG 

AAAGTCCTTG 

GGATTGTTTT 

CTTTGTGTTT 

960 

CTGATCATGT 

GGTGCCCATT 

TTTCATTACC 

AATATTCTGT 

CTGTTCTTTG 

TGAGAAGTCC 

1020 

TGTAACCAAA 

AGCTCATGGA 

AAAGCTTCTG 

AATGTGTTTG 

TTTGGATTGG 

CTATGTTTGT 

1080 

TCAGGAATCA 

ATCCTCTGGT 

GTATACTCTG 

TTCAACAAAA 

TTTACCGAAG 

GGCATTCTCC 

1140 

AACTATTTGC 

GTTGCAATTA 

TAAGGTAGAG 

AAAAAGCCTC 

CTGTCAGGCA 

GATTCCAAGA 

1200 

GTTGCCGCCA 

CTGCTTTGTC 

TGGGAGGGAG 

CTTAATGTTA 

ACATTTATCG 

GCATACCAAT 

1260 

GAACCGGTGA 

TCGAGAAAGC 

CAGTGACAAT 

GAGCCCGGTA 

TAGAGATGCA 

AGTTGAGAAT 

1320 

TTAGAGTTAC 

CAGTAAATCC 

CTCCAGTGTG 

GTTAGCGAAA 

GGATTAGCAG 

TGTGTGA 

1377 


(28) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 


<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 


Met 

Val 

Asn 

Leu 

Arg 

Asn 

Ala 

Val 

His 

Ser 

Phe 

Leu 

Val 

His 

Leu 

lie 

1 




5 





10 





15 


Gly 

Leu 

Leu 

Val 

Trp 

Gin 

Cys 

Asp 

lie 

Ser 

Val 

Ser 

Pro 

Val 

Ala 

Ala 




20 





25 





30 



lie 

Val 

Thr 

Asp 

lie 

Phe 

Asn 

Thr 

Ser 

Asp 

Gly 

Gly 

Arg 

Phe 

Lys 

Phe 



35 





40 





45 




Pro 

Asp 

Gly Val 

Gin 

Asn 

Trp 

Pro 

Ala 

Leu 

Ser 

lie 

Val 

lie 

lie 

lie 


50 





55 





60 





He 

Met 

Thr 

lie 

Gly 

Gly 

Asn 

lie 

Leu 

Val 

lie 

Met 

Ala 

Val 

Ser 

Met 

65 





70 





75 





80 

Glu 

Lys 

Lys 

Leu 

His 

Asn 

Ala 

Thr 

Asn 

Tyr 

Phe 

Leu 

Met 

Ser 

Leu 

Ala 





85 





90 





95 


lie 

Ala 

Asp 

Met 

Leu 

Val 

Gly 

Leu 

Leu 

Val 

Met 

Pro 

Leu 

Ser 

Leu 

Leu 



100 





105 





110 



Ala 

lie 

Leu 

Tyr Asp 

Tyr 

Val 

Trp 

Pro 

Leu 

Pro 

Arg 

Tyr 

Leu 

Cys 

Pro 



115 





120 





125 




Val 

Trp 

lie 

Ser 

Leu 

Asp 

Val 

Leu 

Phe 

Ser 

Thr 

Ala 

ser 

lie 

Met 

His 


130 





135 





140 





Leu 

Cys 

Ala 

lie 

Ser 

Leu 

Asp Arg Tyr 

Val 

Ala 

lie 

Arg 

Asn 

Pro 

lie 

145 





150 





155 





160 

Glu 

His 

Ser Arg 

Phe 

Asn 

Ser Arg 

Thr 

Lys 

Ala 

lie 

Met 

Lys 

lie 

Ala 





165 





170 





175 


lie 

Val 

Trp Ala 

lie 

Ser 

lie 

Gly 

Val 

Ser 

Val 

Pro 

lie 

Pro 

Val 

lie 




180 





185 





190 



Gly 

Leu 

Arg Asp 

Glu 

Glu 

Lys 

Val 

Phe 

Val 

Asn 

Asn 

Thr 

Thr 

Cys 

Val 



195 





200 





205 




Leu 

Asn 

Asp 

Pro 

Asn 

Phe 

Val 

Leu 

lie 

Gly 

Ser 

Phe 

Val 

Ala 

Phe 

Phe 


210 




215 





220 





lie 

Pro 

Leu 

Thr 

lie 

Met 

Val 

lie 

Thr 

Tyr 

Cys 

Leu 

Thr 

lie 

Tyr 

Val 

225 





230 





235 





240 

Leu 

Arg 

Arg 

Gin 

Ala 

Leu 

Met 

Leu 

Leu 

His 

Gly 

His 

Thr 

Glu 

Glu 

Pro 




245 





250 





255 


Pro 

Gly 

Leu 

Ser 

Leu 

Asp 

Phe 

Leu 

Lys 

Cys 

Cys 

Lys 

Arg 

Asn 

Thr 

Ala 


260 265 270 
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Glu 

Glu 

Glu 

Asn 

Ser 

Ala 

Asn 

Pro 

Asn 

Gin 

Asp 

Gin 

Asn 

Ala 

Arg 

Arg 



275 





280 





285 




Arg 

Lys 

Lys 

Lys 

Glu 

Arg 

Arg 

Pro 

Arg 

Gly 

Thr 

Met 

Gin 

Ala 

lie 

Asn 


290 



295 





300 





Asn 

Glu 

Arg 

Lys 

Ala 

Ser 

Lys 

Val 

Leu 

Gly 

lie 

Val 

Phe 

Phe 

Val 

Phe 

305 



310 





315 





320 

Leu 

He 

Met 

Trp 

Cys 

Pro 

Phe 

Phe 

lie 

Thr 

Asn 

lie 

Leu 

Ser 

Val 

Leu 




325 





330 





335 


Cys 

Glu 

Lys 

Ser 

Cys 

Asn 

Gin 

Lys 

Leu 

Met 

Glu 

Lys 

Leu 

Leu 

Asn 

Val 


340 




345 





350 



Phe 

Val 

Trp 

lie 

Gly 

Tyr 

Val 

Cys 

Ser 

Gly 

lie 

Asn 

Pro 

Leu 

Val 

Tyr 



355 





360 





365 




Thr 

Leu 

Phe 

Asn 

Lys 

lie 

Tyr 

Arg 

Arg 

Ala 

Phe 

Ser 

Asn 

Tyr 

Leu 

Arg 


370 




375 





380 





Cys 

Asn 

Tyr 

Lys 

Val 

Glu 

Lys 

Lys 

Pro 

Pro 

Val 

Arg 

Gin 

lie 

Pro 

Arg 

385 



390 





395 





400 

Val 

Ala 

Ala 

Thr 

Ala 

Leu 

Ser 

Gly 

Arg 

Glu 

Leu 

Asn 

Val 

Asn 

lie 

Tyr 





405 





410 





415 


Arg 

His 

Thr 

Asn 

Glu 

Pro 

Val 

lie 

Glu 

Lys 

Ala 

Ser 

Asp 

Asn 

Glu 

Pro 



420 





425 





430 



Gly 

lie 

Glu 

Met 

Gin 

Val 

Glu 

Asn 

Leu 

Glu 

Leu 

Pro 

Val 

Asn 

Pro 

Ser 


435 





440 





445 




Ser 

Val 

Val 

Ser 

Glu 

Arg 

lie 

Ser 

Ser 

Val 








450 





455 










INFORMATION 

FOR 

SEQ 

id : 

NO:28: 










(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 


ATGGTGAACC 

TGAGGAATGC 

GGTGCATTCA 

TTCCTTGTGC 

ACCTAATTGG 

CCTATTGGTT 

60 

TGGCAATGTG 

ATATTTCTGT 

GAGCCCAGTA 

GCAGCTATAG 

TAACTGACAT 

TTTCAATACC 

120 

TCCGATGGTG 

GACGCTTCAA 

ATTCCCAGAC 

GGGGTACAAA 

ACTGGCCAGC 

ACTTTCAATC 

180 

GTCATCATAA 

TAATCATGAC 

AATAGGTGGC 

AACATCCTTG 

TGATCATGGC 

AGTAAGCATG 

240 

GAAAAGAAAC 

TGCACAATGC 

CACCAATTAC 

TTCTTAATGT 

CCCTAGCCAT 

TGCTGATATG 

300 

CTAGTGGGAC 

TACTTGTCAT 

GCCCCTGTCT 

CTCCTGGCAA 

TCCTTTATGA 

TTATGTCTGG 

360 
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CCACTACCTA 

GATATTTGTG 

CCCCGTCTGG 

ATTTCTTTAG 

ATGTTTTATT 

TTCAACAGCG 

420 

TCCATCATGC 

ACCTCTGCGC 

TATATCGCTG 

GATCGGTATG 

TAGCAATACG 

TAATCCTATT 

480 

GAGCATAGCC 

GTTTCAATTC 

GCGGACTAAG 

GCCATCATGA 

AGATTGCTAT 

TGTTTGGGCA 

540 

ATTTCTATAG 

GTGTATCAGT 

TCCTATCCCT 

GTGATTGGAC 

TGAGGGACGA 

AGAAAAGGT G 

600 

TTCGTGAACA 

ACACGACGTG 

CGTGCTCAAC 

GACCCAAATT 

TCGTTCTTAT 

TGGGTCCTTC 

660 

GTAGCTTTCT 

TCATACCGCT 

GACGATTATG 

GTGATTACGT 

ATTGCCTGAC 

CATCTACGTT 

720 

CTGCGCCGAC 

AAGCTTTGAT 

GTTACTGCAC 

GGCCACACCG 

AGGAACCGCC 

TGGACTAAGT 

780 

CTGGATTTCC 

TGAAGTGCTG 

CAAGAGGAAT 

ACGGCCGAGG 

AAGAGAACTC 

TGCAAACCCT 

840 

AACCAAGACC 

AGAACGCACG 

CCGAAGAAAG 

AAGAAGGAGA 

GACGTCCTAG 

GGGCACCATG 

900 

CAGGCTATCA 

ACAATGAAAG 

AAAAGCTAAG 

AAAGTCCTTG 

GGATTGTTTT 

CTTTGTGTTT 

960 

CTGATCATGT 

GGTGCCCATT 

TTTCATTACC 

AATATTCTGT 

CTGTTCTTTG 

TGAGAAGTCC 

1020 

TGTAACCAAA 

AGCTCATGGA 

AAAGCTTCTG 

AATGTGTTTG 

TTTGGATTGG 

CTATGTTTGT 

1080 

TCAGGAATCA 

ATCCTCTGGT 

GTATACTCTG 

TTCAACAAAA 

TTTACCGAAG 

GGCATTCTCC 

1140 

AACTATTTGC 

GTTGCAATTA 

TAAGGTAGAG 

AAAAAGCCTC 

CTGTCAGGCA 

GATTCCAAGA 

1200 

GTTGCCGCCA 

CTGCTTTGTC 

TGGGAGGGAG 

CTTAATGTTA 

ACATTTATCG 

GCATACCAAT 

1260 

GAACCGGTGA 

TCGAGAAAGC 

CAGTGACAAT 

GAGCCCGGTA 

TAGAGATGCA AGTTGAGAAT 

1320 

TTAGAGTTAC 

CAGTAAATCC 

CTCCAGTGTG 

GTTAGCGAAA 

GGATTAGCAG 

TGTGTGA 

1377 


(30) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 


Met 

1 

Val 

Asn 

Leu 

Arg Asn 
5 

Ala 

Val 

His 

Ser 

10 

Phe 

Leu Val 

His 

Leu 

15 

lie 

Gly 

Leu 

Leu 

Val 

20 

Trp 

Gin 

Cys 

Asp 

lie 

25 

Ser 

Val 

Ser Pro 

Val 

30 

Ala 

Ala 

He 

Val 

Thr 

35 

Asp 

lie 

Phe 

Asn 

Thr 

40 

Ser 

Asp 

Gly 

Gly Arg 
45 

Phe 

Lys 

Phe 

Pro 

Asp 

50 

Gly 

Val 

Gin 

Asn Trp 
55 

Pro 

Ala 

Leu 

Ser 

lie Val 
60 

lie 

lie 

lie 

lie 

Met 

Thr 

lie 

Gly 

Gly Asn 

lie 

Leu 

Val 

lie 

Met Ala 

Val 

Ser 

Met 
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65 





70 




Glu 

Lys 

Lys 

Leu 

His 

Asn 

Ala 

Thr 

Asn 





85 





lie- 

Ala 

Asp 

Met 

Leu 

Val 

Gly 

Leu 

Leu 




100 





105 

Ala 

lie 

Leu 

Tyr 

Asp 

Tyr 

Val 

Trp 

Pro 



115 





120 


Val 

Trp 

lie 

Ser 

Leu Asp 

Val 

Leu 

Phe 


130 





135 



Leu 

Cys 

Ala 

He 

Ser 

Leu 

Asp 

Arg 

Tyr 

145 





150 




Glu 

His 

Ser 

Arg 

Phe 

Asn 

Ser 

Arg 

Thr 





165 





He 

Val 

Trp Ala 

lie 

Ser 

lie 

Gly 

Val 




180 





185 

Gly 

Leu 

Arg 

Asp 

Glu 

Glu 

Lys 

Val 

Phe 



195 





200 


Leu 

Asn 

Asp 

Pro 

Asn 

Phe 

Val 

Leu 

lie 


210 





215 



lie 

Pro 

Leu 

Thr 

lie 

Met 

Val 

lie 

Thr 

225 





230 




Leu 

Arg 

Arg 

Gin 

Ala 

Leu 

Met 

Leu 

Leu 





245 





Pro 

Gly 

Leu 

Ser 

Leu 

Asp 

Phe 

Leu 

Lys 




260 





265 

Glu 

Glu 

Glu 

Asn 

Ser 

Ala 

Asn 

Pro 

Asn 



275 





280 


Arg 

Lys 

Lys 

Lys 

Glu Arg 

Arg 

Pro 

Arg 


290 





295 



Asn 

Glu 

Arg 

Lys 

Ala 

Lys 

Lys 

Val 

Leu 

305 





310 




Leu 

lie 

Met 

Trp 

Cys 

Pro 

Phe 

Phe 

lie 





325 





Cys 

Glu 

Lys 

Ser 

Cys 

Asn 

Gin 

Lys 

Leu 




340 





345 

Phe 

Val 

Trp 

lie 

Gly Tyr 

Val 

Cys 

Ser 



355 





360 


Thr 

Leu 

Phe 

Asn 

Lys 

lie 

Tyr 

Arg 

Arg 


370 





375 



Cys 

Asn 

Tyr 

Lys 

Val 

Glu 

Lys 

Lys 

Pro 


385 390 
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75 





80 

Tyr 

Phe 

Leu 

Met 

Ser 

Leu 

Ala 

90 





95 


Val 

Met 

Pro 

Leu 

Ser 

Leu 

Leu 





110 



Leu 

Pro 

Arg 

Tyr 

Leu 

Cys 

Pro 




125 




Ser 

Thr 

Ala 

Ser 

lie 

Met 

His 



140 





Val 

Ala 

lie 

Arg Asn 

Pro 

lie 


155 





160 

Lys 

Ala 

lie 

Met 

Lys 

lie 

Ala 

170 





175 


Ser 

Val 

Pro 

lie 

Pro 

Val 

lie 





190 



Val 

Asn 

Asn 

Thr 

Thr 

Cys 

Val 




205 




Gly 

Ser 

Phe 

Val 

Ala 

Phe 

Phe 



220 





Tyr 

Cys 

Leu 

Thr 

lie 

Tyr 

Val 


235 





240 

His 

Gly 

His 

Thr 

Glu 

Glu 

Pro 

250 





255 


Cys 

Cys 

Lys 

Arg 

Asn 

Thr 

Ala 





270 



Gin 

Asp 

Gin 

Asn 

Ala Arg 

Arg 




285 




Gly 

Thr 

Met 

Gin 

Ala 

lie 

Asn 


300 





Gly 

lie 

Val 

Phe 

Phe 

Val 

Phe 


315 





320 

Thr 

Asn 

lie 

Leu 

Ser 

Val 

Leu 

330 





335 


Met 

Glu 

Lys 

Leu 

Leu 

Asn 

Val 




350 



Gly 

lie 

Asn 

Pro 

Leu 

Val 

Tyr 




365 




Ala 

Phe 

Ser 

Asn 

Tyr 

Leu 

Arg 



380 





Pro 

Val 

Arg 

Gin 

He 

Pro 

Arg 


395 





400 
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Val 

Ala 

Ala 

Thr 

Ala 

405 

Leu 

Ser 

Gly 

Arg 

Glu 

410 

Leu 

Asn 

Val 

Asn 

lie 

415 

Tyr 

Arg 

His 

Thr 

Asn 

420 

Glu 

Pro 

Val 

lie 

Glu 

425 

Lys 

Ala 

Ser 

Asp 

Asn 

430 

Glu 

Pro 

Gly 

He 

Glu 

435 

Met 

Gin 

Val 

Glu 

Asn 

440 

Leu 

Glu 

Leu 

Pro 

Val 

445 

Asn 

Pro 

Ser 

Ser 

Val 

Val 

Ser 

Glu 

Arg 

lie 

Ser 

Ser 

Val 








450 455 

(31) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 


ATGGATATTC 

TTTGTGAAGA 

AAATACTTCT 

TTGAGCTCAA 

CTACGAACTC 

CCTAATGCAA 

TTAAATGATG 

ACAACAGGCT 

CTACAGTAAT 

GACTTTAACT 

CCGGAGAAGC 

T AACACTT CT 

GATGCATTTA 

ACTGGACAGT 

CGACTCTGAA 

AATCGAACCA 

ACCTTTCCTG 

TGAAGGGTGC 

CTCTCACCGT 

CGTGTCTCTC 

CTTACTTCAT 

CTCCAGGAAA 

AAAACTGGTC 

TGCTTTACTG 

ACAGCCGTAG 

TGATTATTCT 

AACTATTGCT 

GGAAACATAC 

TCGTCATCAT 

GGCAGTGTCC 

CTAGAGAAAA 

AGCTGCAGAA 

TGCCACCAAC 

TATTTCCTGA 

TGTCACTTGC 

CATAGCTGAT 

ATGCTGCTGG 

GTTTCCTTGT 

CATGCCCGTG 

TCCATGTTAA 

CCATCCTGTA 

TGGGTACCGG 

TGGCCTCTGC 

CGAGCAAGCT 

TTGTGCAGTC 

TGGATTTACC 

TGGACGTGCT 

CTTCTCCACG 

GCCTCCATCA 

TGCACCTCTG 

CGCCATCTCG 

CTGGACCGCT 

ACGTCGCCAT 

CCAGAATCCC 

ATCCACCACA 

GCCGCTTCAA 

CTCCAGAACT 

AAGGCATTTC 

TGAAAATCAT 

TGCTGTTTGG 

ACCATATCAG 

TAGGTATATC 

CATGCCAATA 

CCAGTCTTTG 

GGCTACAGGA 

CGATTCGAAG 

GTCTTTAAGG 

AGGGGAGTTG 

CTTACTCGCC 

GATGATAACT 

TTGTCCTGAT 

CGGCTCTTTT 

GTGTCATTTT 

TCATTCCCTT 

AACCATCATG 

GTGATCACCT 

ACTTTCTAAC 

TATCAAGGTT 

CTGCGCCGAC 

AAGCTTTGAT 

GTTACTGCAC 

GGCCACACCG 

AGGAACCGCC 

TGGACTAAGT 

CTGGATTTCC 

TGAAGTGCTG 

CAAGAGGAAT 

ACGGCCGAGG 

AAGAGAACTC 

TGCAAACCCT 

AACCAAGACC 

AGAACGCACG 

CCGAAGAAAG 

AAGAAGGAGA 

GACGTCCTAG 

GGGCACCATG 

CAGGCTATCA 

ACAATGAAAG 

AAAAGCTTCG 

AAGGTACTGG 

GCATCGTCTT 

CTTCCTGTTT 

GTGGTGATGT 

GGTGCCCTTT 

CTTCATCACA 

AACATCATGG 

CCGTCATCTG 

CAAAGAGTCC 
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720 

780 

840 

900 

960 

1020 

1080 
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TGCAATGAGG ATGTCATTGG GGCCCTGCTC AATGTGTTTG TTTGGATCGG TTATCTCTCT 1140 
TCAGCAGTCA ACCCACTAGT CTATACTCTG TTCAACAAAA TTTACCGAAG GGCATTCTCC 1200 
AACTATTTGC GTTGCAATTA TAAGGTAGAG AAAAAGCCTC CTGTCAGGCA GATTCCAAGA 1260 
GTTGCCGCCA CTGCTTTGTC TGGGAGGGAG CTTAATGTTA ACATTTATCG G CAT AC CAAT 132 0 
GAACCGGTGA TCGAGAAAGC CAGTGACAAT GAGCCCGGTA TAGAGATGCA AGTTGAGAAT 1380 
TTAGAGTTAC CAGTAAATCC CTCCAGTGTG GTTAGCGAAA GGATTAGCAG TGTGTGA 1437 


(32) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID. NO:31: 


Met 

1 

Asp 

lie 

Leu 

Cys 

5 

Glu 

Glu 

Asn 

Thr 

Ser 

10 

Leu 

Ser 

Ser 

Thr 

Thr 

15 

Asn 

Ser 

Leu 

Met 

Gin 

20 

Leu 

Asn Asp 

Asp 

Asn 

25 

Arg 

Leu 

Tyr 

Ser 

Asn 

30 

Asp 

Phe 

Asn 

Ser 

Gly 

35 

Glu 

Ala 

Asn 

Thr 

Ser Asp 
40 

Ala 

Phe 

Asn 

Trp 

45 

Thr 

Val Asp 

Ser 

Glu 

50 

Asn Arg 

Thr 

Asn 

Leu 

55 

Ser 

Cys 

Glu 

Gly 

Cys 

60 

Leu 

Ser 

Pro 

Ser 

Cys 

65 

Leu 

Ser 

Leu 

Leu 

His 

70 

Leu 

Gin 

Glu 

Lys 

Asn 

75 

Trp 

Ser 

Ala 

Leu 

Leu 

80 

Thr 

Ala 

Val 

Val 

lie 

85 

lie 

Leu 

Thr 

lie 

Ala 

90 

Gly 

Asn 

lie 

Leu 

Val 

95 

lie 

Met 

Ala 

Val 

Ser 

100 

Leu 

Glu 

Lys 

Lys 

Leu 

105 

Gin 

Asn 

Ala 

Thr 

Asn 

110 

Tyr 

Phe 

Leu 

Met 

Ser 

115 

Leu 

Ala 

lie 

Ala 

Asp 

120 

Met 

Leu 

Leu 

Gly 

Phe 

125 

Leu 

Val 

Met 

Pro 

Val 

130 

Ser 

Met 

Leu 

Thr 

lie 

135 

Leu 

Tyr 

Gly Tyr 

Arg Trp 
140 

Pro 

Leu 

Pro 

Ser 

145 

Lys 

Leu 

Cys 

Ala 

Val 

150 

Trp 

lie 

Tyr 

Leu 

Asp 

155 

Val 

Leu 

Phe 

Ser 

Thr 

160 

Ala 

Ser 

lie 

Met 

His 

165 

Leu 

Cys 

Ala 

He 

Ser 

170 

Leu 

Asp Arg 

Tyr 

Val 

175 

Ala 

He 

Gin 

Asn 

Pro 

180 

lie 

His 

His 

Ser Arg 
185 

Phe 

Asn 

Ser 

Arg 

Thr 

190 

Lys 

Ala 
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Phe 

Leu 

Lys 

lie 

lie 

Ala 

Val 

Trp 

Thr 

lie 

Ser 

Val 

Gly 

lie 

Ser 

Met 



195 





200 





205 




Pro 

He 

Pro 

Val 

Phe 

Gly 

Leu 

Gin 

Asp 

Asp 

Ser 

Lys 

Val 

Phe 

Lys 

Glu 


210 





215 





220 





Gly 

Ser 

Cys 

Leu 

Leu 

Ala 

Asp 

Asp 

Asn 

Phe 

Val 

Leu 

lie 

Gly 

Ser 

Phe 

225 





230 





235 





240 

Val 

Ser 

Phe 

Phe 

lie 

Pro 

Leu 

Thr 

He 

Met 

Val 

lie 

Thr 

Tyr 

Phe 

Leu 





245 





250 





255 


Thr 

lie 

Lys 

Val 

Leu 

Arg 

Arg 

Gin 

Ala 

Leu 

Met 

Leu 

Leu 

His 

Gly His 




260 





265 





270 



Thr 

Glu 

Glu 

Pro 

Pro 

Gly 

Leu 

Ser 

Leu 

Asp 

Phe 

Leu 

Lys 

Cys 

Cys 

Lys 



275 





280 





285 




Arg 

Asn 

Thr 

Ala 

Glu 

Glu 

Glu 

Asn 

Ser 

Ala 

Asn 

Pro 

Asn 

Gin 

Asp 

Gin 


290 





295 





300 




Asn 

Ala 

Arg 

Arg 

Arg 

Lys 

Lys 

Lys 

Glu 

Arg Arg 

Pro 

Arg 

Gly 

Thr 

Met 

305 





310 





315 





320 

Gin 

Ala 

lie 

Asn 

Asn 

Glu 

Arg 

Lys 

Ala 

Ser 

Lys 

Val 

Leu 

Gly 

lie 

Val 





325 





330 





335 


Phe 

Phe 

Leu 

Phe 

Val 

Val 

Met 

Trp 

Cys 

Pro 

Phe 

Phe 

lie 

Thr 

Asn 

lie 




340 





345 





350 



Met 

Ala 

Val 

lie 

Cys 

Lys 

Glu 

Ser 

Cys 

Asn 

Glu Asp 

Val 

lie 

Gly Ala 



355 





360 





365 




Leu 

Leu 

Asn 

Val 

Phe 

Val 

Trp 

lie 

Gly 

Tyr 

Leu 

Ser 

Ser 

Ala 

Val 

Asn 


370 





375 




380 





Pro 

Leu 

Val 

Tyr 

Thr 

Leu 

Phe 

Asn 

Lys 

lie 

Tyr Arg 

Arg Ala 

Phe 

Ser 

385 





390 





395 





400 

Asn 

Tyr 

Leu 

Arg 

Cys 

Asn 

Tyr 

Lys 

Val 

Glu 

Lys 

Lys 

Pro 

Pro 

Val Arg 





405 





410 





415 


Gin 

lie 

Pro 

Arg 

Val 

Ala 

Ala 

Thr 

Ala 

Leu 

Ser 

Gly 

Arg 

Glu 

Leu 

Asn 




420 





425 





430 



Val 

Asn 

lie 

Tyr Arg 

His 

Thr 

Asn 

Glu 

Pro 

Val 

lie 

Glu 

Lys 

Ala 

Ser 



435 





440 





445 




Asp 

Asn 

Glu 

Pro 

Gly 

lie 

Glu 

Met 

Gin 

Val 

Glu 

Asn 

Leu 

Glu 

Leu 

Pro 


450 





455 





460 





Val 

Asn 

Pro 

Ser 

Ser 

Val 

Val 

Ser 

Glu 

Arg 

lie 

Ser 

Ser 

Val 




465 470 475 

(33) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


18 


SUBSTITUTE SHEET (RULE 26) 



WO 99/52927 


PCT/US99/08168 


(ii) MOLECULE TYPE: DNA (genomic) 


(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 32: 


ATGGATATTC 

TTTGTGAAGA 

AAATACTTCT 

TTGAGCTCAA 

CTACGAACTC 

CCTAATGCAA 

60 

TTAAATGATG 

ACAACAGGCT 

CTACAGTAAT 

GACTTTAACT 

CCGGAGAAGC 

TAACACTTCT 

120 

GATGCATTTA 

ACTGGACAGT 

CGACTCTGAA 

AATCGAACCA 

ACCTTTCCTG 

TGAAGGGTGC 

180 

CTCTCACCGT 

CGTGTCTCTC 

CTTACTTCAT 

CTCCAGGAAA 

AAAACTGGTC 

TGCTTTACTG 

240 

ACAGCCGTAG 

TGATTATTCT 

AACTATTGCT 

GGAAACATAC 

TCGTCATCAT 

GGCAGTGTCC 

300 

CTAGAGAAAA 

AGCTGCAGAA 

TGCCACCAAC 

TATTTCCTGA 

TGTCACTTGC 

CATAGCTGAT 

360 

ATGCTGCTGG 

GTTTCCTTGT 

CATGCCCGTG 

TCCATGTTAA 

CCATCCTGTA 

TGGGTACCGG 

420 

TGGCCTCTGC 

CGAGCAAGCT 

TTGTGCAGTC 

TGGATTTACC 

TGGACGTGCT 

CTTCTCCACG 

480 

GCCTCCATCA 

TGCACCTCTG 

CGCCATCTCG 

CTGGACCGCT 

ACGTCGCCAT 

CCAGAATCCC 

540 

ATCCACCACA 

GCCGCTTCAA 

CTCCAGAACT 

AAGGCATTTC 

TGAAAATCAT 

TGCTGTTTGG 

600 

ACCATATCAG 

TAGGTATATC 

CATGCCAATA 

CCAGTCTTTG 

GGCTACAGGA 

CGATTCGAAG 

660 

GTCTTTAAGG 

AGGGGAGTTG 

CTTACTCGCC 

GATGATAACT 

TTGTCCTGAT 

CGGCTCTTTT 

720 

GTGTCATTTT 

TCATTCCCCT 

GACGATTATG 

GTGATTACGT 

ATTGCCTGAC 

CATCTACGTT 

780 

CTGCGCCGAC 

AAGCTTTGAT 

GTTACTGCAC 

GGCCACACCG 

AGGAACCGCC 

TGGACTAAGT 

840 

CTGGATTTCC 

TGAAGTGCTG 

CAAGAGGAAT 

ACGGCCGAGG 

AAGAGAACT C 

TGCAAACCCT 

900 

AACCAAGACC 

AGAACGCACG 

CCGAAGAAAG 

AAGAAGGAGA 

GACGTCCTAG 

GGGCACCATG 

960 

CAGGCTATCA 

ACAATGAAAG 

AAAAGCTAAG 

AAAGTCCTTG 

GGATTGTTTT 

CTTTGTGTTT 

1020 

CTGATCATGT 

GGTGCCCTTT 

CTTCATCACA 

AACATCATGG 

CCGTCATCTG 

CAAAGAGTCC 

1080 

TGCAATGAGG 

ATGTCATTGG 

GGCCCTGCTC 

AATGTGTTTG 

TTTGGATCGG 

TTATCTCTCT 

1140 

TCAGCAGTCA 

ACCCACTAGT 

CTATACTCTG 

TTCAACAAAA 

TTTACCGAAG 

GGCATTCTCC 

1200 

AACTATTTGC 

GTTGCAATTA 

TAAGGTAGAG 

AAAAAGCCTC 

CTGTCAGGCA 

GATTCCAAGA 

1260 

GTTGCCGCCA 

CTGCTTTGTC 

TGGGAGGGAG 

CTTAATGTTA 

ACATTTATCG 

GCATACCAAT 

1320 

GAACCGGTGA 

TCGAGAAAGC 

CAGTGACAAT 

GAGCCCGGTA 

TAGAGATGCA 

AGTTGAGAAT 

1380 

TTAGAGTTAC 

CAGTAAATCC 

CTCCAGTGTG 

GTTAGCGAAA 

GGATTAGCAG 

TGTGTGA 

1437 


(34) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: protein 


(xi)- SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Met Asp lie Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 
15 10 15 

Ser Leu Met Gin Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe 
20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
50 55 60 

Cys Leu Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu 

65 70 75 80 

Thr Ala Val Val lie lie Leu Thr lie Ala Gly Asn lie Leu Val lie 

85 90 95 

Met Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe 
100 105 110 

Leu Met Ser Leu Ala lie Ala Asp Met Leu Leu Gly Phe Leu Val Met 
115 120 125 

Pro Val Ser Met Leu Thr He Leu Tyr Gly Tyr Arg Trp Pro Leu Pro 
130 135 140 

Ser Lys Leu Cys Ala Val Trp lie Tyr Leu Asp Val Leu Phe Ser Thr 

145 150 155 160 

Ala Ser lie Met His Leu Cys Ala lie Ser Leu Asp Arg Tyr Val Ala 

165 170 175 

lie Gin Asn Pro lie His His Ser Arg Phe Asn Ser Arg Thr Lys Ala 
180 185 190 

Phe Leu Lys lie lie Ala Val Trp Thr lie Ser Val Gly lie Ser Met 
195 200 205 

Pro lie Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu 
210 215 220 

Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu lie Gly Ser Phe 

225 230 235 240 

Val Ser Phe Phe lie Pro Leu Thr lie Met Val lie Thr Tyr Cys Leu 

245 250 255 

Thr lie Tyr Val Leu Arg Arg Gin Ala Leu Met Leu Leu His Gly His 
260 265 270 

Thr Glu Glu Pro Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys 
275 280 285 

Arg Asn Thr Ala Glu Glu Glu Asn Ser Ala Asn Pro Asn Gin Asp Gin 
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290 295 

Asn Ala Arg Arg Arg Lys Lys Lys 
305 310 

Gin- Ala lie Asn Asn Glu Arg Lys 
325 

Phe Phe Val Phe Leu He Met Trp 
340 

Met Ala Val lie Cys Lys Glu Ser 
355 360 

Leu Leu Asn Val Phe Val Trp lie 
370 375 

Pro Leu Val Tyr Thr Leu Phe Asn 
385 390 

Asn Tyr Leu Arg Cys Asn Tyr Lys 
405 

Gin lie Pro Arg Val Ala Ala Thr 
420 

Val Asn lie Tyr Arg His Thr Asn 
435 440 

Asp Asn Glu Pro Gly lie Glu Met 
450 455 

Val Asn Pro Ser Ser Val Val Ser 
465 470 


300 

Glu Arg Arg Pro Arg Gly Thr Met 
315 320 

Ala Lys Lys Val Leu Gly lie Val 
330 335 

Cys Pro Phe Phe lie Thr Asn lie 
345 350 

Cys Asn Glu Asp Val lie Gly Ala 
365 

Gly Tyr Leu Ser Ser Ala Val Asn 
380 

Lys lie Tyr Arg Arg Ala Phe Ser 
395 400 

Val Glu Lys Lys Pro Pro Val Arg 
410 415 

Ala Leu Ser Gly Arg Glu Leu Asn 
425 430 

Glu Pro Val lie Glu Lys Ala Ser 
445 

Gin Val Glu Asn Leu Glu Leu Pro 
460 

Glu Arg lie Ser Ser Val 
475 
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